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DESCRIPTION 

IMPROVED METHODS AND M ATERIALS FOR TRANSFORMATION 

This invention was made with United States government support awarded by the 
following agency: 

NIH Grant No: GM35072 
The United States has certain rights in this invention. 

Background of the Invention 
RNA viruses have been found to be valuable tools in the phenotypic and genotypic 
transformation of targeted cells and tissues. See, e.g., U.S. Patent No. 5,500,360, which teaches 
novel viral RNA expression vectors. It has been shown that the RNA of the genome of an RNA 
virus can be modified to include an exogenous RNA segment and that the modified RNA can 
be introduced into a host cell, replicated therein, and thereby express the exogenous RNA 
segment 

Current methods of inoculating a host cell with modified RNA viruses involve the in 
vitro transcription of a particular strand followed by the introduction of the resulting RNA 
transcripts into the host cell. One problem with the current inoculation method is that the RNA 
rapidly degrades which causes a low efficiency of infection. In addition, the preparation of the 
in vitro RNA transcripts is expensive and time consuming. 

Further, with the advent of transformation and the genetic engineering of plants, much 
concern has arisen concerning the potential hazard of the dispersal of dangerous traits into the 
environment. For example, genes increasing the stress tolerance and/or herbicide resistance of 
an agriculturally important crop could theoretically "leak" to surrounding less desirable and 
damaging plants, e.g., through pollen, mechanical or insect dispersal. This phenomenon could 
create a novel species of "super-weed" which could wreak havoc on the agricultural industry. 
Existing RNA virus-based vectors can spread to non-target plants by mechanical means and/or 
by insects. Such spread can be prevented by using vectors that can replicate and/or move only 
in target plants expressing the appropriate trans-acting factors. Accordingly, there remains a 
need for less expensive and more efficient methods of transformation of target cells and tissues. 
Moreover, there is a need for a novel method of transformation which alleviates the potential 
dangers associated with the unwanted spread of engineered traits into the environment. 
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Brief Summary of the Tnvention 
The subject invention pertains to improved materials and methods for transforming host 
cells which involve transfecting said cells with a DNA-launching platform. One aspect of the 
subject invention pertains to a DNA-launching platform which encodes a modified viral RNA 
molecule downstream of DNA-dependent RNA polymerase (pol) promoter, whereby the DNA- 
launching platform is capable of being introduced into a host cell and effectively "launching" 
said modified viral RNA molecule into the host cell such that it is replicated and expressed 
therein. The term "modified viral RNA molecule" as used herein refers to a viral RNA which 
has been changed from its natural state. Examples of changes of viral RNA include, but are not 
limited to, removal of a part of viral RNA genome, insertion or substitution of an exogenous 
RNA, etc. The exogenous RNA segment can be located in a region of the viral RNA molecule 
such that it does not disrupt the RNA replication. Techniques for such manipulations have been 
well known to those of ordinary skill in the art for many years. Preferably, the modified viral 
RNA molecule further comprises a ribozyme which is located in the proximity of the 3' end of 
the modified viral RNA molecule. The viral segment may have the ability to be replicated with 
or, alternatively, without the presence of trans-acting viral replicating elements. 

Another aspect of the subject invention pertains to a method of genotypically or 
phenotypically modifying a host cell, comprising introducing a DNA-launching platform which 
encodes a viral RNA molecule and an exogenous RNA segment in a location which does not 
disrupt the replication of said viral RNA segment or said exogenous RNA segment, whereby the 
exogenous RNA segment confers a detectable trait in the host cell. The subject invention applies 
to a wide array of plant cells. 

Still a further aspect of the subject invention pertains to cells in which the DNA- 
launching platform of the subject invention has been introduced. 

Yet another aspect of the subject invention pertains to a plant comprising cells 
transfected with the DNA-launching platform. 

The novel methods and materials of the subject invention provide a greater inoculation 
efficiency of RNA viruses because use of DNA-launching platforms of the subject invention are 
more resistant to degradation than RNA inocula, and because each DNA platform produces 
multiple RNA transcripts over an extended period of time. As the DNA-launching platform 
provides a genetically stable in planta archive copy of a desired vector construct, the continuing 
transcription of said DNA platform will repeatedly reinoculate the host cell with the desired 
construct. This serves to counteract genetic instability problems that have inhibited the 
expression of some genes from vectors based on plant and animal RNA viruses. Further, the 



WO 99/61597 PCT/US99/1 1250 

3 

inoculation methods of the subject invention provide a much simpler means of producing inocula 
in bulk for large scale use, which is cheaper and more efficient than inoculating with in vitro 
RNA transcripts. 



5 Brief Description of the Drawings 

Figure 1 represents the schematic for producing the 1 a and 2a proteins in the host cell. 
Figure 2 illustrates an example of axi Agrobacterium transformation vector containing 
an expression cassette capable of expressing la and/or 2a BMV proteins. 

Figure 3 illustrates several Agrobacterium vectors that were produced to transform host 
10 plant cells (black rectangles indicate T-DNA borders). 

Figure 4 represents the general mechanism of BMV RNA3 launching, and replication. 
Figure 5 depicts DNA-launching platforms which can be used in accord with the 
teachings contained herein. The BMV and CCMV designations denote cis-acting elements. 

Figure 6 depicts DNA-launching platforms which can be used in accord with the 
1 5 teachings contained herein. 

Figure 7 depicts DNA-launching platforms which can be used in accord with the 
teachings contained herein. 

Figure 8 depicts DNA-launching platforms which can be used in accord with the 
teachings contained herein. 
20 Figure 9 depicts Agrobacterium vector for delivery of DNA-launching platforms to 

plant cells (open triangles represent T-DNA borders). 

Figure 10 depicts DNA-launching platforms which can be used in accord with the 
teachings contained herein. 



25 Legend For Figures 5-10: 

35S = CaMV35S promoter 

t = term in at ion/poly A + sequences 

Rz = ribozyme 

NOS = NOS promoter 

30 OOA = origin of assembly 

FG = foreign gene 



Figure 11 shows that BMV replication factors support efficient RNA3 replication in 
protoplasts. 
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Figure 12 shows the efficient replication of launched BMV RNA3 in protoplasts. 
Figure 13 shows transgenic expression of BMV la and 2a mRNAs in N. tabacum and 
N. benthamiana. 

Figure 14 shows the efficient replication of launched BMV RNA3 in (la + 
5 2a)- transgenic plants. 

Figure 15 shows the successful GUS expression from the launched BMV RNA3 in (la 
+ 2a)- transgenic plants. 

Figure 16 shows the successful GUS expression from the launched BMV RNA3 in 
protoplasts. 

10 Figure 17 shows the successful GFP expression from the launched BMV RNA3 in (la 

+ 2a) - transgenic plants. 

Figure 18 shows the successful GFP expression from the launched BMV RNA3 in 
protoplasts. 

Figure 19 shows the efficient replication of the launched BMV RNA3 in (la + 2a> 
1 5 transgenic N. benthamiana using Agrobacterium inoculation. 

Figure 20 shows the successful GUS expression from the launched BMV RNA3 having 
the SHMV coat protein in (la + 2a)-transgenic plants. 

Figure 21 shows that launched BMV replicates, moves cell-to-cell, and spreads long 
distances in (la+2a)-transgenic plants. 
20 Figure 22 shows transfection of progeny from ( 1 a+2a)-transgenic N. benthamiana with 

BMV RNA3 DNA-launching platform and localization of the launched RNA3 to the roots. 

Brief Description of the Sequences 
SEQ ID NO, 1: pBlLR2 — partial nucleotide sequence includes BMV la expression 
25 cassette. 

SEQ ID NO. 2: pBlLR3 - partial nucleotide sequence includes BMV la expression 
cassette. 

SEQ ID NO. 3: pB2LR4 - partial nucleotide sequence includes BMV 2a expression 
cassette. 

30 SEQ ID NO. 4: pB2LR5 — partial nucleotide sequence includes BMV 2a expression 

cassette. 

SEQ ID NO. 5: pB12LR6 - partial nucleotide sequence includes BMV la and 2a 
expression cassettes. 
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SEQ ID NO. 6: pB12LR7 — partial nucleotide sequence includes BMV la and 2a 
expression cassettes. 

SEQ ID NO. 7: pB12LR8 — partial nucleotide sequence includes BMV la and 2a 
expression cassettes. 

5 SEQ ID NO. 8: pB12LR9 — partial nucleotide sequence includes BMV la and 2a 

expression cassettes. 

Detailed Dis closure of the Invention 
To facilitate understanding of the invention, certain terms used throughout are herein 

10 defined. The term "RNA virus" as used herein means a virus whose genome is RNA in a 
double-stranded or single-stranded form, the single strand being a(+) strand or (-) strand. 

The terms "transfection" or "transfected" as used herein means an introduction of a 
foreign DNA or RNA into a cell by mechanical inoculation, electroporation, agroinfection, 
particle bombardment, microinjection, or by other known methods. 

15 The terms "transformation" or "transformed" as used herein means a stable 

incorporation of a foreign DNA or RNA into the cell which results in a permanent, heritable 
alteration in the cell. Accordingly, the skilled artisan would understand that transfection of a cell 
may result in the transformation of that cell. 

The term "launched" as used herein refers to a polynucleotide that has been transcribed 

20 from a DNA-launching platform, as described herein and, preferably, replicated. 

The term "cis-acting element" as used herein denotes that portion of the RNA genome 
of an RNA virus which must be present in cis, that is, present as a part of each viral strand as a 
necessary condition for replication of that strand. Virus replication may depend upon the 
existence of one or more trans (diffusible) elements which interact with the cis-acting element 

25 to cany out RNA replication. If trans-acting elements are necessary for replication, they need 
not be present or coded for on the modified viral RNA provided, but may be made available 
within the infected cell by some other means. For example, the trans-acting replication functions 
may be provided by other, unmodified or modified, components of the viral genome transfected 
into the cells simultaneously with the modified RNA. The same approach can be used for other 

30 trans-acting functions including movement protein, coat protein, and other functions. The target 
cell may also be premodified, for example, cells may have been previously transformed to 
provide constitutive expression of the trans-acting functions from a chromosome. The cis-acting 
element is composed of one or more segments of viral RNA which must be present on any RNA 
molecule that is to be replicated within a host cell by RNA replication. The segment will most 
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likely be the 5' and 3' terminal portions of the viral RNA molecule, and may include other 
portions and/or virus open reading frames as well. The cis-acting element is accordingly defined 
in functional terms: any modification which destroys the ability of the RNA to replicate in a cell 
known to contain the requisite trans-acting elements, is deemed to be a modification in the cis- 
5 acting element Conversely, any modification, such as deletion or insertion in a sequence region 
which is able to tolerate such deletion or insertion without disrupting replication, is a 
modification outside the cis-acting element. As is demonstrated herein, using the example of 
BMV which is known and accepted by those skilled in the art to be a functional example from 
which substantial portions of an RNA virus molecule may be modified, by deletion, insertion, 
10 or by a combination of deletion and insertion, without disrupting replication. 

"Exogenous RNA" is a term used to describe a segment or component of RNA to be 
inserted into the virus RNA to be modified, the source of the exogenous RNA segment being 
different from the RNA virus itself. The source may be another virus, an organism such as a 
plant, animal, bacteria, virus, or fungus. The exogenous RNA may be a chemically synthesized 

1 5 RNA, derived from a native RNA, or it may be a combination of the foregoing. The exogenous 
RNA may provide any function which is appropriate and known to be provided by an RNA 
segment. Such functions include, but are not limited to, a coding function in which the RNA 
acts as a messenger RNA encoding a sequence which, when translated by the host cell, results 
in synthesis of a peptide or protein having useful or desired properties; the RNA segment may 

20 also be structural, as for example in ribosomal RNA; it may be regulatory, as for example with 
small nuclear RNAs or anti-sense RNA; or it may be catalytic. One skilled in the art will 
understand that the exogenous RNA may encode, for example, a protein which is a key enzyme 
in a biochemical pathway, which upon expression effects a desirable phenotypic characteristic, 
such as altering cell metabolism. Further, the exogenous RNA may encode a protein involved 

25 in transcriptional regulation, such as zinc finger, winged-helix, and leucine-zipper proteins. A 
particularly interesting function is provided by anti-sense RNA, sometimes termed (-) strand 
RNA, which is in fact a sequence complementary to another RNA sequence present in the target 
cell which can, through complementaiy base pairing, bind to and inhibit the function of the RNA 
in the target cell. 

30 The term "non-viral" is used herein in a special sense to include any RNA segment 

which is not normally contained within the virus whose modification is exploited for replication 
and expression, and is therefore used synonymously with "exogenous". Accordingly, a gene 
derived from a different virus species than that which is modified is included within the meaning 
of the terms "non-viral" and "exogenous" for the purposes of describing the invention. For 
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example, a non-viral gene as the term is used herein could include a gene derived from a 
bacterial virus, an animal virus, or a plant virus of a type distinguishable from the virus modified 
to effect transformation. In addition, a non-viral gene may be a structural gene derived from any 
prokaryotic or eukaryotic organism. 
5 In one embodiment, the subject invention concerns a novel method of transfecting a host 

cell which uses a DNA- launching platform to introduce viral RNA into the cell. The subject 
invention is directed towards a method of transfection employing a DNA-launching platform 
which encodes a modified viral RNA molecule comprising an RNA viral component attached 
to an exogenous RNA component and a DNA-dependent RNA pol promoter. The DNA- 

1 0 dependent RNA pol promoter is preferably but not necessarily fused within up to 10 nucleotides 
of the 5' transcriptional start site of the modified viral RNA molecule, and more preferably 
within up to 5 nucleotides of the 5' transcriptional start site. Expression of the DNA-launching 
platform produces transcripts of the modified viral RNA molecule that are then capable of RNA 
replication in the presence of replication factors, which can be present in the modified viral RNA 

15 and/or may be supplied in trans by other means including expression from chromosome or 
supplied on different launching plasmids. When the modified viral RNA is replicated, the 
exogenous RNA can be replicated as well. Further, the exogenous RNA can be expressed in the 
cell, thereby providing a predetermined phenotypic characteristic. In a preferred embodiment, 
the DNA launching platform further comprises a nucleotide sequence encoding a self-cleavable 

20 ribozyme situated proximate to the 3 ' end of said RNA molecule. As would be readily apparent 
to those skilled in the art, known ribozymes may be used in accordance with the subject 
invention. In a preferred embodiment, the ribozyme cleaves the modified RNA viral molecule 
at the 3' region. The 3' region can consist of up to 30 nucleotides upstream or downstream of 
the 3 ' end; and preferably consists of up to 1 0 nucleotides upstream or downstream of the 3 ' end. 

25 In a more preferred embodiment, the ribozyme cleaves the modified RNA viral molecule 
precisely at the 3' end. Other known regulatory sequences, e.g., promoters and/or termination 
sequences, may also be substituted for and/or included on the DNA-launching platform. A 
suitable restriction site can be introduced proximate to the 3' end of the modified viral RNA 
molecule sequence and the DNA molecule can be cleaved by an appropriate restriction enzyme 

30 prior to transfection. The term "DNA-launching platform" as used herein is intended to mean 
a DNA molecule, circular or linear, which has a coding region comprising a segment encoding 
a modified viral RNA segment, and further, which is capable of being delivered into a cell and 
subsequently transcribed. 
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Possible regulatory sequences can include, but are not limited to, any promoter already 
shown to be constitutive for expression, such as those of viral origin (CaMV 19S and 35S) or 
so-called "housekeeping* 1 genes (ubiquitin, actin, tubulin) with their corresponding 
termination/polyA + sequences. Also, seed-and/or developmental ly- specific promoters, such 
5 as those from plant fatty acid/lip id biosynthesis genes (ACPs, acyltransferases, desaturases, lipid 
transfer protein genes) or from storage protein genes (zein, napin, cruciferin, conglycinin, 
phaseolin, or lectin genes, for example), with their corresponding termination/polyA + sequences 
can be used for targeted expression. In addition, the gene can be placed under the regulation of 
inducible promoters and their termination sequences so that gene expression is induced by light 

1 0 (rbcS-3A ? cab-1 ), heat (hsp gene promoters) or wounding (mannopine, HGPGs). It is clear to 
one skilled in the art that a promoter may be used either in native or truncated form, and may be 
paired with its own or a heterologous termination/polyA + sequence. 

In a particularly preferred embodiment, the subject invention is directed toward a 
method of genotypically or phenotypically modifying a cell comprising the following steps: a) 

1 5 forming a cDNA molecule of a virus RNA, or of at least one RNA component if the RNA virus 
is multipartite, the viral RNA having been modified to contain a DNA segment encoding a non- 
viral RNA component situated in a region able to tolerate such insertion without disrupting 
replication of the RNA product encoded thereby; b) cloning modified cDNA into a DNA- 
launching platform; and c) transfecting a suitable host cell with said DNA-launching platform. 

20 In a most preferred embodiment, the method further comprises pretransforming a plant with 
trans-acting viral replication factors and/or other trans-acting factors. Such trans-acting factors 
may include viral movement proteins(s), coat protein(s), viral protease(s), and other structural 
and nonstructural genes. In addition to stable expression of trans-acting factors, trans-acting 
factors may be introduced on separate expression plasmids or may be expressed from RNA 

25 transcripts. In a preferred embodiment such trans-acting factors do not replicate. Suitable host 
cells may include protoplasts, cells in suspension, or cells in tissues or whole organisms. 

In a specific embodiment intended as an example of the broader teachings herein, the 
RNA viral segment can be derived from brome mosaic virus (BMV), whereby the DNA- 
launching platform comprises DNA encoding the RNA3 segment of the virus. Brome mosaic 

30 virus (BMV) is a member of the a virus-like super family of positive-strand RNA viruses of 
animals and plants, and has a genome divided among three RNAs. RNA1 and RNA2 encode the 
la and 2a proteins, respectively, which are necessary for a genomic RNA replication and 
subgenomic mRNA synthesis (see, e.g., U.S. Patent No. 5,500,360, which to the extent not 
inconsistent herewith, is incorporated herein by reference). These proteins contain three 
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domains conserved in all other members of the a virus-like super family. 1 a ( 1 09kDa) contains 
a c-proximal helicase-like domain and an n-proximal domain implicated in RNA capping, and 
2a (94kDa) contains a central polymerase-like domain. See, e.g., French and Ahlquist, (1 988). 
la and 2a interact with each other and with cell factors to form a membrane bound viral RNA 
5 replication complex associated with the endoplasmic reticulums of infected cells. BMV RNA3, 
a 2.1-kb RNA, encodes the 3a protein (32kDa) and coat protein (20kDa), which are involved in 
the spread of BMV infection in its natural plant hosts but are dispensable for RNA replication. 
See U.S. Patent No. 5,500,360. The 3a or coat protein gene of the RNA3 viral segment can be 
replaced with exogenous RNA, whereby it does not interfere with the replication element. 

1 0 Further, the exogenous RNA segment can be inserted downstream of an additional subgenomic 
promoter. Still further, cells or tissues can be pretransformed to express la, 2a, 3a, and coat 
protein, or any combination thereof, wherein DNA-launching platforms containing a foreign 
gene(s) with the necessary cis-acting components is transfected, such that the foreign gene is 
replicated and/or expressed. 

15 In one embodiment, the host cell is pretransformed with BMV 1 or BMV2 such that it 

is transgenically engineered to express la and 2a proteins. Preferably, the 5' and 3' ends of 
BMV1 and BMV2 are removed such that they are incapable of replication, but can express la 
and 2a to form a viral RNA replication complex associated with the endoplasmic reticulum of 
the host cell. Subsequent transfection of a DNA-launching platform comprising the RNA3 viral 

20 replication segment, as well as the exogenous RNA of interest, can produce the expression of 
said exogenous RNA while also preventing the undesired and dangerous spread of viral RNA 
spillage into the environment. That is, because a plant must have all 3 segments to form 
infectious BMV particle(s), problems associated with the environmentally hazardous escape of 
foreign genes through mechanical or insect dispersal of RNA virus vectors are avoided. One 

25 skilled in the art will readily appreciate that in the example of BMV that DNA-launching 
platforms could be also derived from either RNA1 or RNA2. For example, the sequence 
encoding the la protein could be replaced with an exogenous RNA; replication would require 
the expression of la (e.g., separate expression plasmid). In a preferred embodiment, the DNA- 
launching platform also comprises a ribozyme situated proximate to the 3' end of the modified 

30 RNA3, wherein said ribozyme cleaves the RNA3 at the 3' end. As would be readily apparent 
to the skilled artisan with the teachings contained herein, viral segments from other known 
viruses, and/or subviral agents, can be used to formulate DNA-launching platforms of the subject 
invention. One skilled in the art will appreciate that BMV is merely one representative example 
of the many viruses suitable for practicing the subject invention. It is widely accepted that 
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principles on which the subject invention is based are broadly applicable to a myriad of viruses. 
Examples of other such viruses include, but are not limited to, alfalfa mosaic virus (AMV), 
barley stripe mosaic virus, cowpea mosaic virus, cucumber mosaic virus, reoviruses, polio virus, 
sindbis virus, vesicular stomatitis virus, influenza virus, retroviruses, and cowpea chlorotic 
5 mottle virus (CCMV) and any other viruses that replicate through RNA intermediates and from 
which a cDNA copy can be obtained. Specifically, as the other viruses are further characterized, 
those of skill in the art will readily appreciate the applicability of the teachings herein to other 
suitable viruses as well. 

The skilled artisan would easily appreciate that known methods of introducing foreign 

1 0 DNA into cells can be used in accordance with the teachings of the subject disclosure. Such 
methods include, but are not limited to, mechanical inoculation, particle bombardment, 
agroinfection, electroporation, and microinjection, as well as other known methods. 

Various aspects of the invention can be modified as needed, depending upon specific 
characteristics of the virus selected as the transforming and transfecting agent and of the RNA 

1 5 segment to be inserted. For example, the inserted gene need not be a naturally occurring gene, 
but may be modified, a composite of more than one coding segment, or it may encode more than 
one protein. The RNA may also be modified by combining insertions and deletions in order to 
control the total length or other properties of the modified RNA molecule. The inserted non- 
viral gene may be either prokaiyotic or eukaryotic in origin. The inserted gene may contain its 

20 own translation start signals, for example, a ribosomal binding site and start (AUG) codon, or 
it may be inserted in a manner which takes advantage of one or more of these components 
preexisting in the viral RNA to be modified. Certain structural constraints must be observed to 
preserve correct translation of the inserted sequence, according to principles well understood in 
the art. For example, if it is intended that the exogenous coding segment is to be combined with 

25 an endogenous coding segment, the coding sequence to be inserted must be inserted in reading 
frame phase therewith and in the same translational direction. 

It will be understood by those ordinarily skilled in the art that there may exist certain 
genes whose transfer does not result in obvious phenotypic modification of the recipient cell. 
Such may occur, for example, if the translation product of the non-viral gene is toxic to the host 

30 cell, is degraded or processed in a manner which renders it non-functional or possesses structural 
features which render it impossible for the host cell to translate in sufficient quantities to confer 
a detectable phenotype on the transformed cells. However, the invention does not depend upon 
any specific property of an RNA segment or gene being transferred. Therefore, the possible 
existence of RNA segments or genes which fail to confer a readily observable phenotypic trait 
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on recipient cells or plants is irrelevant to the invention, and in any case will be readily 
recognizable by those of ordinary skill in the art without undue experimentation. 

An exogenous RNA segment may be inserted at any convenient insertion site in any of 
the cDNA sequences corresponding to a viral RNA, or component RNA of a multipartite RNA 
5 virus, provided the insertion does not disrupt a sequence essential for replication of the RNA 
within the host cell. For example, for a virus whose coat protein is not essential for replication, 
an exogenous RNA segment may be inserted within or substituted for the region which normally 
codes for coat protein. As desired, regions which contribute to undesirable host cell responses 
may be deleted or inactivated, provided such changes do not adversely affect the ability of the 

1 0 RNA to be replicated in the host cell. For many single component and multipartite RNA viruses, 
a reduction in the rate of normal RNA replication is tolerable and will in some instances be 
preferred, since the amount of RNA produced in a normal infection is more than enough to 
saturate the ribosomes of the transformed cell. 

Plant cells which are inoculated in culture will normally remain transfected as the cells 

15 grow and divide since the RNA components expressed from the DNA-launching platform are 
able to replicate and thus become distributed to descendant cells upon cell division. Plants 
regenerated from phenotypically modified cells, tissues, or protoplasts remain phenotypically 
modified. Similarly, plants transfected as seedlings remain transfected during growth. Optimal 
timing of application of the transfecting components will be governed by the result which is 

20 intended and by variations in susceptibility to the transfecting components during various stages 
of plant growth. 

Many plant RNA viruses are seed transmitted from one generation to the next This 
property can be exploited to effect genotypic transformation of a plant. That is to say, the 
modified RNA remains transmissible from one generation to the next, just as seed-borne virus 
25 infections are transmitted from one generation to the next. 

Following are examples which illustrate procedures for practicing the invention. These 
examples should not be construed as limiting. All percentages are by weight and all solvent 
mixture proportions are by volume unless otherwise noted. 

30 

Example 1 - Construction o f A^robacterium Vectors 

Binary vectors for expressing the BMV la and 2a proteins in plants were constructed. 
Starting with the pBI 1 0 1 .2 construct (Clontech, Palo Alto, CA), the GUS gene was removed by 
first cutting the construct with EcoRI and SnaBI. The overhanging restriction fragment ends 
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were filled in by treatment with Klenow fragments and dNTPs. The restriction fragment ends 
were religated forming the pB 1 0 1 .2LR1 . 

The 2a expression cassette was inserted into pBI 101.2 LR1. First the pBI 101 .2LR1 was 
cut with Hind III and dephosphorylated. Next, pB2PA 17 (Dinant e/ a/., 1993) was cut with Hind 
5 III and the 2a insert was purified using a low melting agarose gel. The restriction fragment ends 
were Iigated forming the pB2LR4 and pB2LR5 (Figures 3c and 3d). 

The la expression cassette was inserted into pBI101.2LRl by first cutting pBI 101. 2LR1 
with SnaBI and dephosphorylated. pBlPA17 (Dinant et al, 1993) was cut with PstI and the 
extra nucleotides were removed with T4 DNA polymerase. The la insert was purified using a 
10 low melting agarose gel. The restriction fragment ends were Iigated forming the pBlLR2 and 
pBlLR3 vectors (Figures 3a and 3b). 

The la expression cassette was inserted into pB2LR4 and pB2LR5 by cutting pB2LR4 
or pB2LR5 with SnaBI and dephosphorylated. PB 1 PA 1 7 (Dinant et al , 1 993) was cut with PstI, 
and the extra nucleotides were removed with T4 DNA polymerase. The la insert was purified 
IS using low melting agarose gel and Iigated with the cut pB2LR4 or pB2LR5 vectors to form 
pB12LR6, pB12LR7, pB12LR8, and pB12LR9 vectors (Figures 3e-3h). 

Example 2 - Construction of DNA-launching Platform for wtRNA3 of BMV and for RNA 

Derivatives Containing Foreign Sequences 
20 Vector pRTl 0 1 (Topfer et al, 1987) was cut with PpuMI and the restriction fragment 

ends were filled in with Klenow fragment and dNTPs, and cut with BamHI and 

dephosphorylated. Vector pB3RQ39 (Ishikawa et al 9 1997) was cut with SnaBI and BamHI; 

the B3 fragment was isolated from a low melting agarose gel. This fragment was Iigated to the 

cut pRTlOl thereby forming pB3LR10 (Figure 4). The pB3LR15 (Figure 4) that is a pB3LR10 
25 derivative has the Clal-Kpnl fragment replaced with the corresponding fragment from pB3TP8 

(Janda et al, 1987). 

PCR was performed on pRTlOl to amplify an EcoRV and EcoRI fragment. To create 
a StuI site instead of a PpuMI site, a one nucleotide deletion was performed during the PCR 
process. The resulting PCR product was cut with EcoRV and EcoRI and inserted into 
30 dephosphorylated pRTl 01 cut with EcoRV and EcoRI to form pRTlOlLRll. The pRTlOlLRl 1 
was cut with StuI and BamHI and dephosphoiylated. PB3RQ39 was cut with SnaBI and BamHI 
and a B3 fragment was isolated using a low melting agarose gel. The fragment was then Iigated 
to pRTlOlLRl 1 to form pB3LR12 (Figure 4). 
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Another DNA-launching platform was constructed with wtRNA3 of BMV having a 
partially doubled CaMV35S promoter; thereby forming pB3LR14 and pB3LR16 (Figure 4). 

A DNA-launching platform wherein the BMV RNA3 coat protein was replaced with 
GUS was also constructed. The pB3MI22 (Ishikawa et al 9 1 997) was cut with Clal and StuI and 
5 a B3GUS insert was isolated. The pB3LRl 0 or pB3LRl 4 DNA-launching constructs were cut 
with Clal and StuI and dephosphorylated. The B3GUS fragment was then ligated to the cut 
pB3LRl 0 or pB3LR14 thereby forming the pB3GUSLRl 7 and pB3GUSLRl 8 DNA-launching 
constructs (Figure 5). 

A DNA-launching platform having a BMV RNA3 with a GUS gene insertion wherein 
10 the GUS is downstream of an additional BMV subgenomic promoter was constructed. The 
pB3LR15 construct was cut with Aval and the restriction fragment ends were filled in with 
Klenow fragment and dNTPs. Construct was then cut with Clal and dephosphorylated. The 
pB3MI22 was cut with Clal and StuI and a B3GUS fragment was isolated. The isolated B3GUS 
fragment was then ligated to the cut pB3LR15 construct to form a new construct of 
1 5 pB3GUSCPLRl 9 (Figure 5). 

A BMV RNA3 based DNA-launching platform with a CP gene inserted downstream of 
an additional cowpea chlorotic mottle virus (CCMV) subgenomic promoter was constructed. 
The pB3GUSLR17 construct was cut with StuI and Kpnl and dephosphorylated. The pBC3AJ14 
(Pacha and Ahlquist, 1991) was cut with Ndel, the ends were blunted by known methods in the 
20 art, and then cut with Kpnl. A coat protein fragment was then isolated. The coat protein 
fragment was then ligated to the cut pB3GUSLRl 7 to form a new construct of pB3GUSCPLR22 
(FigureS). 

A DNA-launching platform was constructed having a subgenomic RNA4. The pB4MK2 
(M. Kroll, personal communications) was cut with SnaBI and BamM and a RNA4 fragment was 
25 then isolated. Hie pRT 1 0 1 LR1 1 construct was cut with StuI and BamHI and dephosphorylated. 
The fragment and the cut pRTlOlLRl I construct were then ligated forming pB4LR20 (Figure 
5a). 

A DNA-launching platform wherein the BMV coat protein was replaced with GFP was 
constructed. pEGFP (Clontech, CA) was cut with Nod, filled in with Klenow fragment and 
30 dNTPs, cut with Sail, and GFP insert was isolated using low-melting agarose gel. The 
pB3LRl 5 was cut with Sail and StuI and dephosphoiylated. The GFP fragment was then ligated 
to the cut pB3LR15 thereby forming the pB3GFPLR48 (Figure 6e). 

A DNA-launching platform having a BMV RNA3 with a GFP gene insertion wherein 
the CP is downstream of an additional CCMV subgenomic promoter was constructed. The 
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pBC3AJ14 (Pacha and Ahlquist, 1991) was cut with Ndel and EcoRI and the ends were blunted 
by known methods in the art. The coat protein fragment was then isolated and ligated into 
dephosphorylated and blunted pEGFP cut with NotI and StuI forming pEGFPCPLR49. 
pEGFPCPLR49 was cut with Kpnl and the EGFPCP fragment was isolated using low-melting 
5 agarose gel. PB3GFPLR48 was cut with Kpnl and dephosphorylated. The EGFPCP fragment 
was then ligated to the cut pB3GFPLR48 thereby forming the pB3GFPCPLR50 (Figure 6a). 

An RNA transcription vector wherein the GFP gene is expressed as a translational fusion 
with BMV 3a was constructed. The pB3TP10 (Pacha and Ahlquist, 1 991 ) was cut with Bam HI 
and dephosphorylated. The GFP fragment was amplified from pEGFP (Clontech, CA) using 
0 PCR and the following primers: 

5 'GCAGTCG ACGGTACCGCGGGCC3 ' 
and 

5 ' CGCGGCCGCGG ATCCTGTAC AGCTCG3 ' . 
The amplified product was cut with BamHI and purified using low-melting agarose gel. The 
5 GFP fragment was ligated to the cut pB3TP10 forming pB3GFPLR47 (Figure 6d). The 
pB3GFPLR47 was cut with EcoRI and transcribed using T7 RNA polymerase. 

An Agrobacterium vector containing BMV RNA3 DNA-launching platform was 
constructed. The pBI10L2LRl was cut with Smal and dephosphorylated. The pB3LR15 was 
cut with PvuII and the B3 fragment was purified using a low-melting agarose gel. The B3 
0 fragment was then ligated to the cut pBI 1 0 1 .2LR1 thereby forming pB3LR42 (Figure 9). 

A DNA-launching platform wherein the BMV RNA3 coat protein was replaced with the 
SHMV (Sunn hemp mosaic virus) coat protein and the GUS gene was inserted downstream of 
an additional BMV subgenomic promoter was constructed. The pB3RS4 (Sacher et aL, 1988) 
was cut with Aval, blunted with Klenow fragment and dNTPs, and cut with Kpnl. The SHMV 
5 coat protein fragment was isolated using a low-melting agarose gel. The pB3GUSLRl 7 was cut 
with StuI and Kpnl and dephosphorylated. The SHMV coat protein fragment was ligated to the 
cut pB3GUSLRl 7 thereby forming pB3GUSCPLR24 (Figure 7). 

Other permutations of DNA-launching platforms containing one or more foreign genes 
and the necessary cis-acting replication signals will be readily appreciated in view of the 
) teachings herein. For examples, see Figures 5-10. 
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Example 3 - Transfection of N. tahacum Protoplasts with DNA-launching Platform 
Media: 

NT1 Medium (1 liter) was made with Gibco-BRL (MS salt, catalog #111 18-031), 3ml 
of 6% KH2P04, and 0.2 £*g/ml 2,4D (final concentration). The pH was adjusted to 5.5-5.7 using 
KOH, and the resulting mixture was autoclaved. 

NT1 Plating Medium (1 liter) was made with NT1 medium and 72.86 g mannitol, the 
pH was adjusted to 5.5-5.7, and the resulting mixture was autoclaved. 

Wash Solution (1 liter) was made with 72,86 g mannitol, the pH was adjusted to 5.5, and 
the resulting mixture was autoclaved. 

Electroporation Buffer was made with 0.8% NaCl, 0.02% KC1, 0.02% KH2P04, 0. 1 1% 
Na2HP04, and 0.4M mannitol. The pH was adjusted to 6.5, and the resulting mixture was 
autoclaved. 

Enzyme Solution was made with 0.4M mannitol, and 20mM MES. The pH was adjusted 
to 5.5, and the resulting mixture was autoclaved. 

Growth conditions: Cells (Nicotiana tabacum) were grown at room temperature in NT 1 
media with constant shaking (about 200 rpm). 

Preparation of cultures for digestion: About 2-3 ml of one-week old suspension culture 
was subcultured into 50 ml of fresh NT1 media 3 days before the enzyme digestion. The culture 
was maintained at 28 °C under constant shaking. 

Enzyme digestion: The enzyme digestion solution was prepared containing the 
following: 1% cellulysin (Calbiochem) and 0.3% macerase (Calbiochem) in the enzyme 
solution. The pH was adjusted to 5.5 and filter sterilized. 

The cells were centrifuged at 800 rpm for 5 min. The supernatant was discarded. About 
40 ml of wash solution was added, cells were resuspended and were centrifuged at 800 rpm for 
5 min. The supernatant was discarded. The cells were then resuspended in three volumes of 
enzyme digestion solution, and incubated for 60 min. at room temperature. 

Washing: The cells were transferred into 50 ml plastic tube and centrifuged at 800 rpm 
for 5 min. The supernatant was discarded. The cells were resuspended in 40 ml of wash 
solution and centrifuged at 800 rpm for 5 min. The supernatant was discarded. The cells were 
resuspended in 40ml of electroporation buffer and centrifuged at 800 rpm for 5 min. The 
supernatant was discarded. The cells were resuspended in four volumes of electroporation 
buffer. 

Electroporation: One ml of cells containing the RNA or DNA inocula was transferred 
into electroporation cuvettes and placed on ice for 1 0 min. The cells were then mixed and 
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electroporated at 500 microF, 250V. The cuvettes were placed on ice for 1 0 min. The cells were 
transferred into 10 ml of NT1 plating media. 

Incubation and collection of samples: The cells were incubated at room temperature in 
dark. Samples were collected 24-48 hrs post inoculation. 
5 RNA Analysis'. RNA extraction, denaturing 1 % agarose gel electrophoresis and Northern 

blot hybridization were performed by known methods, such as that performed in Rasochova and 
Miller (1996). Each lane was loaded with equal amounts (approx. 5 /ig) of total RNA as 
determined by spectrophotometry and confirmed by ethidium bromide staining of ribosomal 
RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in hybridization 
1 0 buffer was used per hybridization experiment Replication of RNA3 was confirmed by detection 
of sgRNA4, thus showing that BMV RNA replication factors la and 2a expressed from 
expression plasm id(s) support efficient replication of RNA3 supplied as in vitro transcript 
(Figure 1 1) as well as launched from DNA-launching platform (Figure 12). 

15 Example 4 - Production of Transgenic N. tabacum Plants 

Once a desired molecule was constructed in E. coli 9 the molecule was transferred into 
Agrobacterium tumefaciens by the freeze-thaw method. Vectors pBlLR2, pB2LR4, pB12LR6, 
and pB12LR7 were all individually used. An Agrobacterium strain LBA 4404 containing an 
appropriate helper Ti plasmid was grown in 5 ml of YEP medium overnight at 28°C. Two ml 

20 of the overnight culture were added to 50 ml YEP medium in a 250-ml flask and shaken 
vigorously (250 rpm) at 28 °C until the culture grew to an OD 500 of 0.5 to 1 .0. The culture was 
chilled on ice. The cell suspension was centrifuged at 3000 g for 5 min. at 4°C. The supernatant 
solution was discarded. The cells were resuspended in 1 ml of ice-cold 20 mM CaCl 2 solution. 
0.1-mI aliquots were dispensed into prechilled eppendorf tubes. About 1 ^g of plasmid DNA 

25 was added to the cells. The cells were frozen in liquid nitrogen. The cells were thawed by 
incubating the test tube in a 37 °C water bath for 5 min. 1 ml of YEP medium was added to the 
tube and incubated at 28 °C for 2-4 h with gentle shaking to allow the bacteria to express the 
antibiotic resistance genes. The tubes were centrifuged for 30 s and the supernatant solution was 
discarded. The cells were resuspended in 0.1 ml YEP medium, plated on a YEP agar plate 

30 containing selection antibiotic(s), and incubated at 28°C. Transformed colonies appeared in 2-3 
days. 

In vitro clonal copies of approximately three week old Nicotina tabacum, Wisconsin No. 
38, were used as the source of explants. Leaf explants were prepared from the second and third 
fully expanded leaves of in vitro cultures. The leaf pieces were cut into 1 cm x 1 cm squares and 
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placed upon TBI (plus 2.0 mg/1 6-benzyl-aminopurine, and 0.1 mg/I -naphthalene acetic acid) 
media for 24 hours at 25 °C with a 1 6 hour photo period. 

Agrobacterium tumefaciens strain LBA 4404 containing the preselected binary vector 
was used for plant transformation. Explants were placed in -10 ml of overnight grown 
5 Agrobacterium culture for 30 min. Leaf explants were then blotted on filter paper and placed 
on TB2 (plus 1.0 mg/1 6-benzyl-aminopurine and 0.1 mg/1 -naphthalene acetic acid) media for 
4 days, abaxial side down. Explants are then rinsed three times in sterile water, blotted on filter 
paper, and placed on TB2 media for regeneration with 100 mg/1 kanamycin and 400 mg/1 
carbenicillin at 25 °C, 16 hour photo period, abaxial side down. Explants were transferred to 
10 fresh TB2 media with 100 mg/1 kanamycin and 400 mg/I carbenicillin every 10 to 14 days until 
plantlets developed. Plantlets typically developed at 10-14 days. Plantlets were cut from the 
callus and placed on MST media containing 100 mg/1 kanamycin and 400 mg/I carbenicillin to 
induce rooting. Rooted plants were transferred to soil. 

TBI (1 liter) included 4.30 g MS salts, 100 mg myo-inositol, 1.0 ml Nitsch and Nitsch 
15 vitamins, 30 g sucrose, 2 mg BAP, 0.10 mg of NAA, and 8g Noble agar. The media was 
adjusted to a pH 5.7 and autoclaved. 

TB2 (1 liter) included 4.30 g MS salts, 100 mg myo-inositol, 1.0 ml Nitsch and Nitsch 
vitamins, 30 g sucrose, 1.0 mg BAP, 0.10 mg NAA, and 8 g Noble agar. The media was 
adjusted to pH 5.7 and autoclaved. 
20 MST (1 liter) included 4.30 g MS salts, 1 .0 ml Nitsch and Nitsch vitamins, 30 g sucrose, 

100 mg myo-inositol, and 8.5 g Difco agar. The media was adjusted to pH 5.7 and autoclaved. 

YEP (100 ml) included 1 .0g Bacto-peptone, 1.0 g Bacto^yeast extract, and 0.5 g NaCl. 
The media was autoclaved. 

RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 
25 Northern blot hybridization was performed by known methods, such as that performed in 
Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 ^g) of total 
RNA as determined by spectrophotometry and confirmed by ethidium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 
hybridization buffer was used per hybridization experiment Figure 13a shows the successful 
30 expression of BMV la and 2a mRNA in transgenic N. tabacum. 



Example 5 - Transfection of Transgenic K tabacum Plants with DNA-launching Platform 

Precipitation ofDNA onto Microcarriers for Particle Bombardment: (Kikkert, 1993). 
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Sterilization of Microcarriers: 80 mg of gold microcarriers were resuspended in 1 ml 
of 70% ethanol, soaked for 15 min., and centrifuged at 13,000 x g for 5 min. The supernatant 
was carefully removed and discarded. Particles were resuspended in 1 ml of sterile distilled, 
deionized water and centrifuged at 13,000 x g for 5 min. The supernatant was carefully removed 
5 and discarded. Water washing of particles was repeated 2 more times. After final rinse, 
particles were resuspended in 1 ml of sterile 50% glycerol. 

Coating Microcarriers with DNA: The following was sequentially and quickly added: 
5/iI DNA (l,ug/Ad), 50/J of 2.5M CaCl 2 , and 20/J of 0.1M Spermidine. 

The mixture was incubated for 10 min. on a vortex shaker at room temperature. 
10 Particles were pelleted by centrifugation at 13,000 x g for 5 sec. Supernatant was carefully 
removed and discarded. Particles were resuspended in 140 iA of 70% ethanol and centrifuged 
at 13,000 x g for 5 sec. Supernatant was removed and discarded. Particles were resuspended 
in 1 40 jA of 1 00% ethanol and centrifuged at 13,000 x g for 5 sec. Supernatant was removed and 
discard. Particles were resuspended in 50/zI of 100% ethanol. 
15 Young leaves from tobacco plants grown in vitro on agar-solidified MS medium 

containing 30g/liter sucrose, were bombarded with 5-/zl aliquots of; resuspended DNA-coated 
particles using a PDSlOOOHe biolistic gun (DuPont) and 1 100 psi rupture disks (Bio-Rad). 

RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 
Northern blot hybridization was performed by known methods, such as that performed in 
20 Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 /zg) of total 
RNA as determined by spectrophotometry and confirmed by ethidium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 
hybridization buffer was used per hybridization experiment Figure 14a shows that the launched 
BMV RNA3 replicates efficiently in transgenic plants expressing BMV replication factors la 
25 and 2a and that the launched RNA3 is unable to replicate in the absence of BMV la and/or 2a. 

Example 6 — Production of Transgenic N. benthamiana Plants 

Once a desired molecule was constructed in E. coli 9 the molecule was transferred into 
Agrobacterium tumefaciens. Vectors pBlLR2, pB2LR4, pB12LR6, and pB12LR7 were all 
30 individually used. An Agrobacterium strain LBA 4404 containing an appropriate helper Ti 
plasmid was grown in 5 ml of YEP medium overnight at 28°C. Two ml of the overnight culture 
were added to 50 ml YEP medium in a 250-ml flask and shaken vigorously (250 rpm) at 28°C 
until the culture grew to an OD 300 of 0.5 to 1.0. The culture was chilled on ice. The cell 
suspension was centrifuged at 3000 g for 5 min. at 4°C. The supernatant solution was discarded. 
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The cells were resuspended in 1 ml of ice-cold 20 mM CaCI 2 solution. 0.1-ml aliquots were 
dispensed into prechilled eppendorf tubes. About 1 pig of plasmid DNA was added to the cells. 
The cells were frozen in liquid nitrogen. The cells were thawed by incubating the test tube in 
a 37°C water bath for 5 min. 1 ml of YEP medium was added to the tube and incubated at 28°C 
5 for 2-4 h with gentle shaking to allow the bacteria to express the antibiotic resistance genes. The 
tubes were centrifuged for 30 s and the supernatant solution was discarded. The cells were 
resuspended in 0.1 ml YEP medium. The cells were plated on a YEP agar plate containing 
selection antibiotic(s) and incubated at 28 °C. Transformed colonies appeared in 2-3 days. 

In vitro clonal copies of approximately five-seven weeks old N. benthamiana were used 

10 as the source of explants. Leaf explants were prepared from the second and third fully expanded 
leaves of in vitro cultures. The leaf pieces were cut into 1cm x 1cm squares and placed upon 
MS 1 04 media in 1 00 x 1 5 mm plates for 24 hours at 23 °C with a 1 6 hour photo period. 

Agrobacterium tumefaciens strain LBA 4404 containing the preselected binary vector 
was used. Explants were placed in -10ml of overnight grown Agrobacterium culture for 30 min. 

1 5 Leaf explants were then blotted on filter paper and placed abaxial side down on MS104 media 
for 4 days. Explants were then rinsed three times in sterile water, blotted on filter paper, and 
placed on MS 1 04 media for regeneration with 300 mg/L kanamycin and 400 mg/L carbenicillin. 
Explants were transferred to fresh MS 104 media with 300 mg/L kanamycin and 400 mg/L 
carbenicillin every 10-14 days until plantlets developed. Plantlets typically developed at 31-50 

20 days. Plantlets were cut from the callus and placed on MST media plus 300 mg/L kanamycin 
and 400 mg/L carbenicillin to induce rooting. Rooted plants were transferred to soil. 

One liter of MS104 included 4.3 g MS salt mixture, 1 .0 ml B5 vitamin solution, 30 g 
sucrose, 1 .0 mg BA, 0.1 mg NAA , and 8.0 g Phytagar. The media was adjusted to pH 5.8 and 
autoclaved. 

25 1 00 ml of YEP included 1 .0 g Bacto-peptone, 1 .0 g Bacto-yeast extract, 0.5 g NaCl. The 

media was autoclaved. 

One liter of MST included 43 g MS salt mixture, 1 .0 ml Nitsch & Nitsch vitamins, 30 
g sucrose, 100 mg myo-inositol , and 8.5 g Phytagar. The media was adjusted to pH 5.7 and 
autoclaved. 

30 RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 

Northern blot hybridization was performed by known methods, such as that performed in 
Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 /ig) of total 
RNA as determined by spectrophotometry and confirmed by ethidium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 
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hybridization buffer was used per hybridization experiment. Figure 13b shows the successful 
expression of BMV la and 2a mRNA in transgenic N. henthamiana. 

Example 7 - Transfection of Transgenic N. henthamiana Plants 
5 Precipitation of DNA onto Microcarriers for Particle Bombardment: (From Kikkert 

(1993) "The biolistic PDS 1000/He device**, Plant Cell Tiss. And Org. Cult. 33:221-226) 

Sterilization of Microcarriers: 80 mg of gold microcarriers were resuspended in 1 ml 
of 70% ethanol, soaked for 15 min., and centrifuged at 13,000 x g for 5 min. The supernatant 
was carefully removed and discarded. Particles were resuspended in 1 ml of sterile distilled, 
1 0 deionized water and centrifuged at 1 3,000 x g for 5 min. The supernatant was carefully removed 
and discarded. Water washing of particles was repeated 2 more times. After final rinse, 
particles were resuspended in 1 ml of sterile 50% glycerol. 

Coating Microcarriers with DNA: To the 50 fil of particles the following was 
sequentially and quickly added: 5pil DNA (l^gf^\\ 50/zl of 2.5M CaCl 2 , and 20/il of 0.1 M 
15 Spermidine. 

The mixture was incubated for 10 min. on a vortex shaker at room temperature. 
Particles were pelleted by centrifugation at 13,000 x g for 5 sec. Supernatant was carefully 
removed and discarded. Particles were resuspended in 140 }A of 70% ethanol and centrifuged 
at 13,000 x g for 5 sec. Supernatant was removed and discarded. Particles were resuspended 

20 in 1 40 fA of 1 00% ethanol and centrifuged at 1 3,000 x g for 5 sec. Supernatant was removed and 
discarded. Particles were resuspended in 50^1 of 100% ethanol. 

Young leaves from N henthamiana plants grown in vitro on agar-solidified MS medium 
containing 30g/liter sucrose, were bombarded with 5-/J aliquots of resuspended DNA-coated 
particles using a PDSlOOOHe biolistic gun (DuPont) and 1 100 psi rupture disks (Bio-Rad). 

25 RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 

Northern blot hybridization was performed by known methods, such as that performed in 
Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 peg) of total 
RNA as determined by spectrophotometry and confirmed by ethidium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 

30 hybridization buffer was used per hybridization experiment. The launched BMV and RNA 3 
showed efficient replication (Figure 14b) in transgenic N henthamiana plants expressing BMV 
replication factors la and 2a and was unable to replicate in the absence of BMV la and/or 2a. 
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Example 8 - Transfection of Transgenic Plants with GUS Containing DNA-launching Platform 
Transgenic N. tabacwn and N. benthaniana plants were produced according to the 
procedures discussed above. The plants were transfected with a DNA-launching platform 
containing a GUS gene (Figure 5a) by particle bombardment as described in Examples 5 and 7. 
The plants were incubated for 3-5 days and then assayed for P-glucuronidase (GUS) activity 
using 1 mg/ml X-Gluc (5-bromo-4-chloro-3-indolyI glucucuronide) as substrate in 0.1M 
potassium phosphate buffer, pH 7.0, 50 /iM potassium ferrocyanide, and 2% Triton® X-100. 
Following an overnight incubation at 37°C, cells replicating launched RNA3 derivatives and 
expressing the GUS reporter gene from a subgenomic RNA4 gave rise to blue spots (Figure 1 5). 
The launched RNA3 derivative did not replicate and express GUS reporter gene in the absence 
of BMV KNA replication factors la and 2a (e.g., in wt N. benthamiana and in wt N. tabacum). 

Example 9 - Transfection of Tra nsgenic Plants Expressing BMV la. 2a. 3a. and CP 

A plant is transformed with BMV la, 2a, 3a, and CP genes whereby those genes are 
stably expressed in said plant. This can be done with the procedures outlined above. Any 
modifications that would be needed would be readily apparent to those skilled in the art in light 
of the teachings contained herein. A DNA-launching platform encoding an RNA replicon which 
contains a foreign gene and necessary BMV or CCMV cis-acting replication signals to replicate 
said replicon is constructed (Figure 10b). Foreign genes to be included in said replicon could 
include, for example, a Bacillus thuringiensis polynucleotide that codes for a Bx protein. Other 
sequences would include, e.g., sequences that encode herbicide resistance, or any other known 
sequence that encodes peptides or proteins having desired qualities in plants. 

Alternatively, plants can be transformed to express BMV la, 2a, 3a, and a TMV coat 
protein in place of the BMV coat protein. A DNA-launching platform is then made containing 
one or more foreign genes and the necessary cis-acting replication signals, either BMV or 
CCMV, and a TMV origin of assembly (Figures 8a, 8b, and 1 0a). This launching platform 
provides a distinct advantage as TMV is a rod-shaped virus which has no strict limit on the size 
of RNA that can be encapsidated. Alternatively, TMV movement protein can be used in place 
of BMV3a (Figure 7c). Hybrids between tobamo and bromoviruses were shown to be viable 
(Sacher et al, 1988; De Jong and Ahlquist, 1992). 

Other permutations and combinations of genes pretransformed and those included in the 
DNA-launching platform will readily be appreciated by the skilled artisan in light of the 
teachings herein. (See, e.g., Figures 8c, 10b, and 10c). 
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As indicated above, CCMV subgenomic promoter can be substituted for BMV 
sequences in a desired DNA-launching platform. Because the sequence of CCMV subgenomic 
promoter differs from the sequence of BMV subgenomic promoter, the probability of 
recombination that would result in loss of a foreign geneis much lower in a construct having a 
5 combination of these two different promoters. 

In the above examples, trans-acting components may include, but are not limited to, 
replication factors, components responsible for cell to cell movement, or components such as 
the coat protein which may be required for long distance spread, viral proteases responsible for 
post translational processing, or other known trans-acting functions. 

10 

Fxample 1 0 — Transfection of N. tabacum Protoplasts with GUS Containing DNA-Launchine 
Platforms 

A', tabacum protoplasts isolated using the above described methods were inoculated by 
electroporation with DNA-launching platforms for BMV RNA3 derivatives in the presence or 

1 5 absence of 1 a and 2a expression plasmids. BMV RNA3 derivatives contained the GUS gene in 
place of the coat protein ORF (Figure 5a) (these were inoculated with or without coat protein 
expression plasmid, Figure 5b), or had the BMVCP gene translated from an additional 
subgenomic RNA driven from BMV or CCMV subgenomic promoter (Figures 5c and 5d), or 
had the SHMV coat protein translated from an additional BMV subgenomic RNA (Figure 7b). 

20 Protoplasts were collected by centrifugation (800 rpm, 5 min.) 24 hours post inoculation. The 
chemiluminescent GUS assay was performed using GUS-Light™ (Tropix, MA) according to 
manufacturer's instructions. Protein concentrations were determined using the Bio-Rad protein 
kit (Bio-Rad Laboratories, Hercules, CA). The GUS values, determined by luminometer, were 
adjusted to the same total protein concentration. Figures 1 6a and 1 6b show successful GUS 

25 expression in protoplasts in the presence of trans-acting BMV replication factors la and 2a. 

Example 1 1 - Transfection of N. tabacum Pro toplasts with GFP Containing DNA-Launching 
Platform 

K tabacum protoplasts isolated by using the above described methods were transfected 
30 by electroporation with expression plasmids for trans-acting BMV replication factors 1 a and 2a 
and with DNA-launching platforms for RNA3 derivatives having the GFP gene in place of BMV 
coat protein ORF (Figure 6e), the CP gene translated from an additional subgenomic RNA 
(Figure 6a) or with an RNA transcript having the GFP expressed as a fusion protein with BMV 
3a ORF (Figure 6d). Protoplasts were incubated for 24 hrs and examined for GFP expression 
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using a fluorescent microscope. Figure 18 shows the successful expression of GFP in 
protoplasts. 

Fxam ple 12 - Transfection of (la + 2aVTransgenic Plants with BMV RNA3-Ba$ed DNA- 
5 Launching Platform Containing GFP 

N. benthamiana plants were transfected using a particle bombardment as described 
above with a DNA-launching platform for BMV RNA3 having the GFP gene in place of BMV 
coat protein (Figure 6e). The GFP expression was determined 24 hrs post inoculation using a 
fluorescent microscope. Figure 17 shows the successful expression of GFP in (la + 2a)- 
1 0 transgenic K benthamiana. 

Example 13 - Transfection of Ha + 2aVTransgenic N. benthamiana with BMV RNA3 DNA- 
T munching Platform Usinp Agrobacterium 

N. benthamiana plants were inoculated with BMV RNA3 DNA-launching platform 

1 5 using Agrobacterium tumefaciens. Once the desired construct (pB3LR42) was obtained in E. 
coli it was transferred to A. tumefaciens strain LBA4404 using a thaw-freeze method as 
described above. The Agrobacterium was grown overnight in 28 °C under constant shaking. A 
single lower leaf of N. benthamiana were punctured with a needle multiple times and submerged 
in Agrobacterium culture. The plants were grown at 23 °C with a 16 hr photoperiod. The 

20 inoculated leaves were harvested 14 days post-inoculation. The total RNA extraction and 
northern blot hybridization were performed as described above. Figure 19 shows replication of 
launched BMV RNA3 in inoculated ( 1 a + 2a)-transgenic N. benthamiana. 

Example 14 - Transfection of Ha + 2aVTrans penic Plants with BMV RNA3-Ba$ed DNA- 

25 Launching Platform Contain ing GUS and SHMV Coat Protein 

N. benthamiana plants were transfected using a particle bombardment as described 
above with a DNA-launching platform for BMV RNA3 wherein the BMV coat protein was 
replaced with the SHMV coat protein (Sunn-hemp mosaic virus) and the GUS gene was inserted 
downstream of an additional BMV subgenomic promoter (Figure 7b). The GUS expression was 

30 determined by histochemical GUS assay described above. Figure 20 shows the successful 
expression of GUS in (la + 2a)-transgenic plants. 
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Example 1 5 - Movement of Launched B MV RNA 3 

Fl progeny plants from self-fertilized (la+2a)-transgenic N. benthamiana BP 14 were 
inoculated with BMV RNA3 DNA launching platform using Agrobacterium tumefaciens. 
Seedlings were germinated on Smurf media containing Kanamycin. Plants were grown at 23 °C 
5 with a 1 6 hr photoperiod. Once the desired construct (pB3LR42) was obtained in £. coli it was 
transferred to A. tumefaciens strain LB A4404 using a thaw-freeze method as described above. 
The Agrobacterium was grown overnight at 28°C under constant shaking. A single lower leaf 
of N. benthamiana was punctured with a needle multiple times and submerged in Agrobacterium 
culture. The inoculated, middle, and upper leaves were harvested 14 days post-inoculation. 
10 Total RNA extraction and northern blot hybridization were performed as described above. 
RNA3 replication was detected in all leaves tested (Fig. 21). It shows that BMV RNA3 is able 
to replicate, move cell-to-cell and spread long distance in (la+2a)-transgenic plants. 

Example 16 - Transfection of Progeny From ( 1 a+2aVTransgenic K benthamiana With BMV 

15 RNA3 DNA-UunchiggPl^tforpi 

Progeny plants from self- fertilized (la+2a)-transgenic AT. benthamiana (designated 
BP 14) were inoculated with BMV RNA3 DNA-launching platform using Agrobacterium as 
described in Example 13. Control plants (non-transgenic N. benthamiana) were inoculated with 
the sap from BMV infected barley using inoculation buffer composed of 50mM NaP0 4 , pH7.0, 

20 andl%celite. Root samples were harvested 6 weeks post inoculation. RNA extraction and 
northern blot hybridization were performed as described above. Figure 22 shows that BMV 
RNA3 replicated to very high levels in roots. In some (la+2a)-transgenic plants (Figure 22, 
lanes 2, 5, 6, 7, 8, 10) replication of launched RNA3 dramatically exceeded replication of wild- 
type BMV in non-transgenic N. benthamiana plants (Figure 22, lane 1). This shows that this 

25 system can be used for delivery of RNA, proteins, peptides or other compounds to roots and 
enables testing of such compounds for various activities, for example, activities directed against 
root parasites. For example, proteins with anti-nematode activities can be inserted into RNA3 
DNA-launching platform using the above described strategies and expressed in roots upon 
RNA3 replication. Such proteins can be engineered to be expressed in the cytoplasm or 

30 alternatively secreted into the surrounding soil. 
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Example 17 - Barlev Stripe Mosaic Virus 

Barley stripe mosaic virus (BSMV) has a tripartite genome (RNA alpha, beta, and 
gamma). These genomic RNAs have an m7Gppp cap at the 5' end and a t-RNA like structure 
at the 3 ' end (Jackson and Hunter, 1 989). 
5 A DNA-launching plasmid for BSMV RNA alpha, RNA beta, and RNA gamma 

containing BSMV RNA cDNA is constructed by precisely fusing at its 5' end to a DNA- 
dependent RNA polymerase promoter and to a self-cleaving ribozyme at its 3' end. A 
polyadenylation signal may be also included. Alternatively, a convenient restriction site may 
be engineered at the 3' end of viral cDNAs. Foreign genes or sequences may be expressed in 

10 several ways. For example, DNA-launching plasmids based on BSMV RNA beta may contain 
a foreign gene or sequence expressed in place of ORF beta a. 

Transgenic plants having one or more trans-acting factors fused to the DNA-dependent 
RNA polymerase promoter and terminator are obtained. Such trans-acting factors may include 
parts of the viral RNA replicase (ORFs alpha a and/or gamma a) or other trans-acting factors. 

1 5 The trans-acting factors are stably expressed in the plant cell or their expression may be induced 
if an inducible promoter is used. Cis-acting sequences necessary for BSMV RNA replication 
are removed from transgenes. Alternatively, the full-length RNA alpha is expressed from the 
chromosome. Alternatively, ORF gamma a including the 5' untranslated region and ORF 
gamma b from a seed transmitted strain, such as ND1 8, are also expressed (Edwards, 1995). 

20 A DNA-launching plasmid is constructed containing the DNA-dependent RNA 

polymerase promoter precisely fused to the 5' end of the BSMV RNA beta, cis-acting elements 
important for BSMV RNA beta life cycle, such as the 5' and 3' ends, the intercistronic region 
between the beta a and beta b ORFs (Zhou and Jackson, 1996) and a foreign gene or sequence 
in place of ORF beta a (coat protein) which is dispensable for BSMV replication and movement 

25 (Petty and Jackson, 1990). Such DNA-launching plasmids may lack the internal poly(A) region 
as this region is dispensable for replication and contain a ribozyme or a convenient restriction 
site at the 3' end of the modified viral RNA. Alternatively, a DNA-launching plasmid is 
constructed from RNA gamma in which ORFs gamma a and/or gamma b are replaced with 
foreign genes or sequences which may also include the triple gene block genes (ORFs beta b, 

30 beta c, and beta d) or a heterologous movement protein (TMV 30K, RCNMV 35K). 



Example 1 8 - TobacpQ Mosaic Vjrq? 

Tobacco mosaic virus (TMV) has a single-stranded positive sense RNA genome. The 
5' end has an m7Gppp cap and the 3' end contains a t-RNA like structure. 
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A DNA-launching plasm id is constructed based on TMV RNA containing TMV cDNA 
precisely fused at its 5' end to a DNA-dependent RNA polymerase promoter and at its 3' end 
to a self-cleaving ribozyme. A polyadenylation signal may be also included. Alternatively, a 
convenient restriction site may be engineered at the 3 ' end. Foreign gene may be expressed from 
5 an additional subgenomic RNA by including an additional subgenomic RNA promoter on the 
(-) strand. 

Transgenic plants are obtained having one or more trans-acting factors fused to the 
DNA-dependent RNA polymerase promoter and terminator. Such factors may include the viral 
replicase (126K/183K), movement protein (30K), or coat protein (17.6K). At least one cis- 

10 acting sequence necessary for TMV RNA replication is removed from transgenes. The trans- 
acting factors are stably expressed in the plant cell or their expression may be induced if an 
inducible promoter is used. 

A DNA-launching plasmid is constructed containing the DNA-dependent RNA 
polymerase promoter precisely fused to the 5' end of the TMV cDNA, cis-acting elements 

1 5 important for the TMV life cycle, such as the 5 ' and 3 ' ends, origin of assembly, etc., at least one 
foreign gene or sequence in place of the trans-acting factor that is expressed from the 
chromosome, and a ribozyme or a convenient restriction site at the 3' end. Alternatively, the 
foreign gene sequence can be expressed from an additional subgenomic RNA promoter and the 
sequence coding for the trans-acting factor that is expressed from the transgene can be deleted 
20 from the DNA-launching plasmid. Preferably, if the viral replicase proteins are expressed in 
transgenic plants, the DNA-launching plasmid will have a deletion of nucleotides 3420-4902, 
which appears to be a region that inhibits replication in trans. (Lewandowski et al. 9 1 998). 

Example 1 9 - Potato Virus X 
25 Potato virus X (PVX) has a single-stranded positive sense RNA genome. The 5 ' end has 

an m7Gppp cap and the 3' end is polyadenylated. A full-length cDNA clone of PVX has been 

constructed and infectious RNA transcripts obtained (Hemenway et al 9 1990). 

A DNA-launching plasmid is constructed based on PVX RNA containing PVX cDNA 

precisely fused at its 5' end to a DNA-dependent RNA polymerase promoter and having a 
30 polyadenylation site at its 3' end. A convenient restriction site may also be included at the 3' 

end. A foreign gene may be expressed from an additional subgenomic RNA. 

Transgenic plants are obtained having one or more trans-acting factors fused to the 

DNA-dependent RNA polymerase promoter and terminator. Such factors may include the viral 

RNA polymerase gene (ORF1-147K), coat protein (ORF5-21K), or triple gene block (ORF2- 
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25K, ORF3-12K, ORF4-8K). The triple gene block genes can be expressed individually. 
Alternatively, they can be expressed as negative sense transcripts from which plus sense 
subgenomic RNA for ORFs 2, 3, and 4 can be transcribed by the viral replicase. Such transgene 
will have a DNA-dependent RNA polymerase promoter fused to sequence of ORFs 2, 3, and 4 
5 in the minus sense orientation and the transcribed sequence will include a subgenomic RNA 
promoter. At least one cis-acting sequence necessary for PVX RNA replication is removed from 
transgenes. The trans-acting factors are stably expressed in the plant cell or their expression may 
be induced if an inducible promoter is used. 

A DNA-launching plasmid is constructed containing the DNA-dependent RNA 

10 polymerase promoter precisely fused to the 5' end of the PVX genome, cis-acting elements 
important for PVX life cycle, such as the 5' and 3' ends, origin of assembly, etc., at least one 
foreign gene or sequence in place of the trans-acting factor that is expressed from the 
chromosome and a polyadenylation signal. Alternatively, the foreign gene sequence can be 
expressed from an additional subgenomic RNA promoter and the sequence coding for the trans- 

1 5 acting factor that is expressed transgenically can be deleted from the DNA-launching plasmid. 

Alternatively, a DNA-launching plasmid is constructed having a DNA-dependent RNA 
polymerase promoter, polyadenylation site, and the PVX cDNA sequence in which the ORF2 
(25K) is replaced with a foreign gene or sequence. Alternatively, the ORF2 is deleted and the 
foreign gene is expressed from an additional subgenomic RNA promoter. Such a DNA- 

20 launching plasmid is inoculated to transgenic plants expressing movement protein from 
heterologous virus, such as tobacco mosaic virus (TMV 30K), tomato mosaic virus (ToMV 
30K), or red clover necrotic mosaic virus (RCNMV 35K). 

Example 20 - Fl ock House Virus 
25 Flock house virus (FHV) has a genome consisting of two single stranded RNAs. RNA1 

encodes protein A, involved in RNA replication, and protein B that is translated from sg RNA3 
and is dispensable for RNA replication. RNA2 encodes virion capsid precursor protein alpha. 
FHV is infectious to insect, plant, mammalian, and yeast cells (Selling et ah, 1990; Price et al, 
1996). 

30 A DNA-launching plasmid is constructed for FHV RNA 1 and RNA2 containing FHV 

RNA cDNA precisely fused at its 5' end to a DNA-dependent RNA polymerase promoter and 
at its 3' end to a self-cleaving ribozyme. A polyadenylation signal may be also included. 
Alternatively, a convenient restriction site may be engineered at the 3' end. Foreign genes or 
sequences may be expressed in several ways. For example, DNA-launching plasmids based on 
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FHV RNA1 may contain a foreign gene or sequence expressed from subgenomic RNA3 as ORF 
B replacement or as a translational fusion with ORF B. Alternatively, a foreign gene may be 
expressed from an additional sg RNA. DNA-launching plasmids based on FHV RNA2 may 
contain a foreign gene(s) or sequence(s) expressed as a part of polyprotein alpha. Foreign 
5 gene(s) in such construct may include sequences necessary for polyprotein clevage. DNA- 
launching plasmids will preferably also express a movement protein of a heterologous plant 
virus, such as 3 OK of TMV or 35K of RCNMV. Alternatively, DNA-launching plasmids will 
be inoculated onto transgenic plants expressing such movement protein. 

Transgenic plants are obtained having one or more trans-acting factors fused to the 
1 0 DNA-dependent RNA polymerase promoter and terminator. Such factors may include protein 
A or capsid protein precursor alpha, and preferably will also include a movement protein from 
a plant virus, such as 30K of TMV or 35K of RCNMV. Trans-acting factors are stably 
expressed in the plant cell or their expression may be induced if an inducible promoter is used. 
Transgenically expressed trans-acting factors preferably lack at least one cis-acting factor which 
1 5 is necessary for their replication, such as the 5 ' and/or 3 ' end. 

A DNA-launching plasmid is constructed based on FHV RNA1 or FHV RNA2 
containing a DNA-dependent RNA polymerase promoter precisely fused to the 5 ' end of RNA 1 
(or RNA2), cis-acting elements important for FHV RNA1 (or RNA2) replication, such as the 5 ' 
and 3' ends, at least one foreign gene or sequence and a self-cleaving ribo2yme at the 3' end. 
20 Polyadenylation signal may also be included. Alternatively, a convenient restriction site may 
be engineered at the 3' end of the modified viral RNA sequence of the DNA-launching plasmid. 
DNA-launching plasmids based on FHV RNA1 may contain a foreign gene or sequence in place 
of ORF A. Alternatively, the ORF A may be deleted and the foreign gene may be expressed 
from subgenomic RNA3, for example as an ORF B replacement or as a translational fusion with 
25 ORF B. Alternatively, DNA-launching plasmid may contain two exogenous RNA sequences, 
one in the place of ORF A and the other expressed from the subgenomic RNA3. DNA- 
launching plasmids based on FHV RNA2 may contain a foreign gene(s) or sequence(s) in place 
of ORF alpha or expressed as a part of polyprotein alpha. Foreign gene(s) in such a construct 
may include sequences necessary for polyprotein clevage. 

30 

Example 21 - Tomato Spotted Wild Virus 

Tomato spotted wild virus (TSWV) is a tripartite (RNA L, M, S), negative sense and 
ambisense, single stranded RNA virus. 
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Transgenic plants are obtained having one or more trans-acting factors fused to the 
DNA-dependent RNA polymerase promoter and terminator. Such factors include the putative 
TSWV polymerase gene (ORF L), ORF N, and possibly other trans-acting factors (NSm or NSs). 
At least one cis-acting sequence, such as 5' and/or 3' ends, which are necessaiy for TSWV RNA 
5 replication are removed from the transgene. Trans-acting factors are stably expressed in the 
plant cell or their expression may be induced if an inducible promoter is used. 

A DNA-Iaunching plasmid is constructed based on TSWV RNA M in which the Gl and 
G2 coding sequences are replaced with at least one foreign gene or sequence. Such DNA- 
launching plasmid contains a DNA-dependent RNA polymerase promoter and TSWV RNA M 
10 cDNA fused to the self-cleaving ribozymes at the 5' and 3' ends. Alternatively, a DNA- 
launching plasmid is constructed based on TSWV RNA S in which the N coding region is 
replaced with a foreign gene or sequence. 

Example 22 — Barley Mild Mosaic Virus 

1 5 Genome of barley mild mosaic virus (BaMMV) consists of two positive sense, single- 

stranded, 3'-polyadenylated RNAs. The RNA1 encodes proteins related to the potyviral P3, 
6K1 , CI, 6K2, NIa-VPg, NIa-Pro, Nib and capsid protein (Kashiwazaki et aL 9 1 990). The RNA2 
encodes PI and P2 protein (Kashiwazaki etal y 1991). The PI protein is related to the potyviral 
HC-Pro and the P2 protein is important for fungal transmission. An isolate was obtain d 

20 containing a deletion in the P2 protein (Timpe and Kuhne, 1995) thus indicating that P2 is 
dispensable for viral RNA replication. 

A DNA-launching plasmid is constructed for BaMMV RNA1 and RNA2 containing 
BaMMV RNA cDNA precisely fused at its 5' end to a DNA-dependent RNA polymerase 
promoter and a polyadenylation site at its 3' end. Foreign genes or sequences may be expressed 

25 in several ways. For example, DNA-launching plasmids based on BaMMV RNA2 may contain 
a foreign gene or sequence expressed as a part of polyprotein which can be cleaved and a foreign 
protein can be released. 

Transgenic plants are obtained having the BaMMV RNA1 cDNA lacking the 5' and 3' 
ends fused to the DNA-dependent RNA polymerase promoter and terminator. 

30 A DNA-launching plasmid is constructed based on BaMMV (isolate M) RNA2. Such 

plasmid contains a DNA-dependent RNA polymerase promoter precisely fused to the 5' end of 
RNA2, RNA2 cis-acting replication signals located in the 5' and 3' ends, PI ORF and a foreign 
gene in place of P2 ORF or expressed as a part of P1/P2 polyprotein which can be cleaved and 
a foreign protein can be released. 
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The contents of all references cited throughout are incorporated herein by this reference 
to the extent they are not inconsistent with the disclosure, teachings, and principles of the subject 
invention. 

It should be understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of this 
application. 
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Claims 



1 1 . A DNA-launching platform comprising: 

2 a) a polynucleotide molecule encoding a modified viral RNA molecule; and 

3 b) a DNA dependent RNA polymerase promoter. 

1 2. The DNA-launching platform of claim 1 further comprising a sequence encoding at 

2 least one cis-acting element. 

1 3. The DNA-launching platform of claim 1 further comprising a ribozyme sequence. 

1 4. The DNA-launching platform of claim 1 further comprising a termination sequence. 

1 5. The DNA-launching platform of claim 1 further comprising a restriction site. 

1 6. The DNA-launching platform of claim 1 wherein said modified RNA molecule 

2 comprises an exogenous RNA segment. 

1 7. The DNA-launching platform of claim 1 wherein said DNA dependent RNA 

2 polymerase promoter is capable of functioning in a plant cell. 

1 8. A method of genotypically orphenotypically modifying one or more cells comprising 

2 the following steps: 

3 a) obtaining a DNA-launching platform comprising a polynucleotide molecule encoding 

4 a modified viral RNA; and 

b) transfecting said one or more cells with said DNA-launching platform, wherein said 
polynucleotide molecule is transcribed thereby forming a replicatable RNA transcript. 

1 9. The method of claim 8 further comprising pre-transforming said cell with at least one 

2 polynucleotide molecule encoding at least one trans-acting factor. 

1 10. The method of claim 8 further comprising introducing a trans-acting factor. 
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1 11. The method of claim 1 0 wherein said introducing a trans-acting factor comprises 

2 co-transfection of an expression plasmid comprising a nucleotide sequence encoding said trans- 

3 acting factor. 

1 12. The method of claim 10 wherein said introducing a trans-acting factor comprises 

2 co-transfection of an RNA transcript encoding said trans-acting factor. 

1 13. The method of claim 10 wherein said trans-acting factor is stably expressed. 

1 14. The method of claim 8 wherein said modified viral RNA comprises an exogenous 

2 RNA segment. 

1 1 5. The method of claim 8 wherein said DNA-launching platform comprises a ribozyme 

2 sequence. 

1 16. The method of claim 8 wherein said DNA-launching platform comprises a 

2 promoter. 

1 17. The method of claim 8 wherein said DNA-launching platform comprises a 

2 termination sequence. 

1 18. The method of claim 8 wherein said DNA-launching platform comprises a 

2 restriction site. 

1 19. The modified cell produced by the method of claim 8. 

1 20. A method of producing a plant or plant tissue comprising at least one genotypically 

2 or phenotypically modified cell, said method comprising transfecting cells of said plant or plant 

3 tissue with a DNA-launching platform, wherein said DNA-launching platform comprises a 

4 polynucleotide encoding a modified RNA molecule, such that said polynucleotide molecule is 

5 transcribed to form a replicatable RNA transcript 

1 21. The method of claim 20 wherein said modified RNA molecule comprises an 

2 exogenous RNA segment. 
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1 22. The method of claim 20 wherein said DNA-Iaunching platform comprises a 

2 ribozyme sequence. 

1 23. The method of claim 20 wherein said DNA-launching platform comprises a 

2 promoter. 

1 24. The method of claim 20 wherein said DNA-launching platform comprises a 

2 termination sequence. 

1 25. The method of claim 20 wherein said DNA-launching platform comprises a 

2 restriction site. 

1 26. A method of producing a genotypically or phenotypically modified plant comprising 

2 obtaining at least one modified cell produced by the method of claim 8; and subjecting said 

3 modified cell to conditions whereby a plant is regenerated therefrom. 

1 27. A plant produced by the method of claim 26. 

1 28. A plant descended from the plant of claim 27. 

1 29. The method of claim 20, wherein said plant or plant tissue comprises one or more 

2 cells transformed with a polynucleotide molecule encoding at least one trans-acting factor, 

3 wherein said polynucleotide molecule is expressed. 

1 30. The method of claim 29, wherein said modified viral RNA molecule is capable of 

2 replication only in said one or more cells transformed with a polynucleotide molecule encoding 

3 at least one trans-acting factor. 
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Vectors for expression of BMV 1a and 2a ORFs in transgenic plants 
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SEQUENCE LISTING 



<110> WISCONSIN ALUMNI RESEARCH FOUNDATION 
Street Address: 614 Walnut Street 
City: Madison 
State: Wisconsin 
Country: US 
ZIP : 53705 

Phone number: (608) 265-2135 Fax: (608) 263-1064 

<120> Improved Methods and Materials for Transformation 



<130> WARF-100XC1 



<140> 
<141> 



<150> 60/086,526 
<151> 1998-05-22 

<160> 8 



<170> Patentln Ver. 2.0 



<210> 1 
<211> 7074 
<212> DNA 

<213> Brome mosaic virus 
<400> 1 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 



AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 



GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCC TAAGGT CACTATCAGC 240 



TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 



AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 



TGCACGCAGG TTCTCCGGCC GCTTGGGTGG 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC 



AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 
TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 480 
TGAATGAACT GCAGGACGAG GCAGCGCGGC 54 0 



TATCGTGGCT GGCCACGACG GGCGTTCCTT 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG 
TTGCTCCTG C CGAGAAAGTA TCCATCATGG 



GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 
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2 

ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
65 CAGC CGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 
TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 

70 

ATATTGCTGA AGAGCTTGGC GGGGAATGGG CTGACCGCTT CCTCGTGCTT TAG GGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
75 NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 

80 

CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
85 TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 

90 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 168 0 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
95 GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCAC CTTT AATGAATAAT 1800 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 1920 

100 

AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 
CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 2040 
105 AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
CTGCAGGTCG ACTCTAGAGG ATCCCCGGTC ACTGGATTTT GGTTTTAGGA ATTAGAAATT 2160 
TTATTGATAG AAGTATTTTA CAAATACAAA TACATACTAA GGGTTTCTTA TATGCTCAAC 2220 

110 

ACATGAGCGA AACCCTATAA GAACCCTAAT TCCCTTATCT GGGAACTACT CACACATTAT 2280 
TCTGGAGAAA ATAGAGAGAG ATAGATTTGT AGAGAGAGAC TGGTGATTTG CGGACTCTAG 2340 
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115 AGGATCCCCG GGTAC CGAGC TCGAATTCTC 
TCACTTAACA CAATTAAAGA TCAAATCACC 
GGACTTCTTG TGTCGTGTTA AGGCAACCAA 

120 

TTTGGTCGAC TTAAGCCGAA CCAAAGTGAC 
TACTGTTTTT ATGTGTCCAT CAATCCAGTC 
125 TTGAAGGGCC GCTTTATCAG CTTGAGT CAT 
GACCTGCAAA CCAGAAGTAA TACGACGCTT 
TGTCCAGGAT TGATACTTCG TGTCCCTATT 

130 

AGCGATAACA TCTTGCGGAC ACCGGTAAGT 
CAGATTACCG TGG AG CAATT TAAAACCCGC 
135 GTCCCCAAAG GCAAGAACTT GTGAACATTT 
ATAATGTAGT AAACCAGCCT CATCAACAAG 
GATCGCAGAA TCCGCGGTGC GCACAACGTC 

140 

GAATAAAGCC ATCCTGACGT CCTCGGCCGA 
TCCCATACGG AATGCATCTT TTATGGCAGT 
145 CATGGAAATA TCGCATGTAG GGACAGAAAC 
GTTAGAGCAT GCACATGTTT TATCAACAAT 
ACCATGCTCA TTATATC CAA CAGTGTAATC 

150 

GGCCAAATGG ACATTCACCA TATCGTCTAT 
TGCGATCTCA CTTCCGCGAC CGCCGGAAAT 
155 GTTGTTATGT AATCTCTTAC AATAACGCAC 
GCCACGAGGG TCCGTTGGTA CCTCAACAGA 
CGGTTTAACG TCATCACTGA CGGACTCAGG 

160 

CTCCTGAGGT ATCTCAGCAG CTGGCGGGAC 
GGCTGTCTGC ACTTCATGCT CCAGCCGGTC 
165 CAAACGTGAT ATCGGTACAG AATCTTGCTC 
AAACTTAAAA CTGTCCGCAT AAGAATCCAG 
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GAGCAGAGGT CTCACACAGA GACAAGCGCA 2400 
AGCGAGCTCG CCGTTAAAGC AATACTCAAA 2460 
ACAGTACTCC TCATGTTTAA ACAAATCACA 2520 
GTTGT CAACA GAGATCCCTT GCGCTTCGTG 2580 
CTTGCTCACG GGAAAATCCT TAGCCCTCGT 2640 
CGTAAGATAC GTTCTGTTCG GATCAATAGT 2 700 
CGTGAGACTT CTAGAAACTT TGGACTCAGA 2 760 
ACCGCATTTA CGCTTCAGCA GATTAACAGC 282 0 
CTTGTGAACA ACGTCACGGC GATCATATTG 2880 
GTCACGAGAC TTGAACGAAA TCTGCTCTGT 294 0 
AGACAGAGCA GCCACCACCA GGAGTTGACC 3000 
CAGCCTATGA CAGGACGGTA CACCGTGCAT 3060 
CAAAGCTACC TTGGAATTAT AAGTGTCAGG 3120 
TTTACGATTC GCCGTCACAA TTAGGTCCTC 3180 
GGTTTTACCG CATCCCGCAA CTCCATCAAC 324 0 
TTTGGCG CTA GCTTCTGCAA TGTCCCTCAA 3300 
GTACGTTTCA TCTGCGTGCT TCGGACCTAA 3360 
GTATTTTTTA GGATACAACC AGTTACCGTT 342 0 
GCGATGGTAG GTCTCAAAGA TGCTCTTATT 3480 
GTCCCATAGG TGACGAAGAT TAGACTCGGA 3540 
AAATTCCTTC ATGGCTCCGT GTCTAGATAT 3600 
CACCTCGGCA TCCGGGACCA CATCAGTCAC 366 0 
GCTCGAACTC TCAGGGGCAT CATGAAACTC 3720 
TTTCG CCTTC TTCTTCGAGC GCTTGGTCTT 3780 
GAATAAGTCC TCTTCAGTCC AAAACGTTCT 384 0 
AAATTCTTCA ACGTTTGAGA GACGAGTCAG 3900 
ACGTAGTAGG GGAAATCTGC TAGCCAATGT 3960 
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TCTCAGCCAT CCTACTTTCG CCCTGGATGA 

170 

GTGATGGCAC CAACCTTTCC ATTCCATCCC 
TTGATACAGA TTCAAAGTCA AAGCAAAGGC 
175 CTCACCAGCC ATGATAGCCT GACCGTTAAT 
AGATGCGACA GCTTTCATGT TCTCAGTCCA 
TATCTCCTCT ACCTCTCTCA CTGTGGTTTT 

180 

ACTCCAGTCT TCAGGTATTG AGACCCCTAC 
GTGACGTAGT GTCTCCCGGG GGCAGCGTAA 
185 GTTACATTTC AGCATTTCGC GCTCCAACAG 
CTCGGTGAAA AATGAGCCCA AATCTTGCCA 
TTCAAAATCG AATTTGATCA CCTCATCCGC 

190 

AAGCAAGGGA AGAAAAC CCT CGCGGTCAAA 
ACGTAGTACG CGTACTCCAT GCGAATGCAT 
195 ATAACCGCCG TGGATACAGA TAGCCCAATC 
AAGACAAAAG TTCGGGACTT CATCGAAATC 
GCACATCCTC TCCTCATGTC GGGCAGCGTC 

200 

CACCCTTTTA TCCCTTCTTG AAAAGTGATG 
GTCTTCAGGG GGAAAACTGT CGAGACAGTC 
205 CAGGCTATGG GGCGCATGAT ACTGCTGAGT 
GTCACGGAAG GCGTCAGCCT CCTCAATAGA 
CCTTTTCGCG TATTCAATCT GCGCAGATAA 

210 

TTGGGCACTC TGGCTGTCAG CACCCTTCTC 
TGACATTTTG TTGGTGAAAA ACAAAGAACA 
215 AAATGAAATG AACTTCCTTA TATAGAGGAA 
TCATCCCTTA CGTCAGTGGA GATATCACAT 
ACGTCTTCTT TTTCCACGAT GTTCCTCGTG 

220 

GTAGAGGCAT TCTTGAACGA TAGCCTTTCC 



4 

ATCTCCACCC CACCAAAACC TAGTTTTGAA 4020 
ATCGCGGAGG GCCGTAAGCT TTTCGTACTT 4080 
CACTAGATGA TAATCTTCAA TGTCTAAGCG 4140 
AATAACAGTC GACGACTTGG CGGATAAGAT 4200 
TTCTTTACTT TCCTTGAAAC ATCTGAAAGC 4260 
GGCGACGCGC ACACATTTCC AGCGATTGAG 4320 
GTACTTAGAT ATGTCTTCAA ACCATACACA 4380 
ATTTGTAGCG ATGATCTTAT AGGTCATGAT 4440 
ATAGGTGGTT CCATCGATGC AATGCACCGA 4500 
TCCGTGGATG TAAGATAATG TGCTTTCATT 4560 
GCCTGACCCG TCACGTTGCC AGTGACATTT 4620 
CAACATGGCG CCGTCGAACA TAACGGTACC 4680 
GGCGTCACAC AGACCTTGGA AGCCCATATC 4740 
AGCTTGGACA TCACAATCTT GAGCTCGGTT 4800 
ATCGCTTTCT TGCAAAATTT TTCGCATGCG 4860 
TCTAACACCC AACACAGGAC AACAACTGTG 4920 
CCACCAAGAC CCTCCGAAAT CTATAACGGG 4980 
ATAATGCTCC GCTACACGCA GAG C ACCAGC 504 0 
CAAATTTAAG TCAAAGGCAC CAC CATAACG 5100 
GAGCTTATTG CGAACGTTGA TTTTCTTAGA 5160 
CTGTTGCGCA ACCTGATTGT CTACGATGTC 5220 
AGCAATCAAC TTCAGCAAAT CGATAGAACT 528 0 
AGTAGCAGAA CCGTGGTCGA GGTCCTCTCC 5340 
GGGTCTTGCG AAGGATAGTG GGATTGTGCG 5400 
CAATCCACTT GCTTTGAAGA CGTGGTTGGA 5460 
GGTGGGGGTC CATCTTTGGG ACCACTGTCG 5520 
TTTATCGCAA TGATGGCATT TGTAGAAGCC 5580 
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ATCTT CCTTT TCTACTGTCC TTTCGATGAA GTGACAGATA GCTGGGCAAT GGAATCCGAG 5640 
225 GAGGTTTCCC GATATTACCC TTTGTTGAAA AGTCTCAATA GCCCTCTGGT CTTCTGAGAC 5 700 
TGTATCTTTG ATATTCTTGG AGTAGACGAG AGTGTCGTGC TCCACCATGT TGACCGGGTG 5 760 
GTCAGTCCCT TATGTTACGT CCTGTAGAAA CCCCAACCCG TGAAATCAAA AAACTCGACG 5820 

230 

GCCTGTGGGC ATTCAGTCTG GATCGCGAAA ACTGTGGAAT TGATCAGCGT TGGTGGGAAA 5880 
GCGCGTTACA AGAAAGCCGG GCAATTGCTG TGCCAGGCAG TTTTAACGAT CAGTTCGCCG 5 940 
235 ATGCAGATAT TCGTAATTAT GCGGGCAACG TCTGGTATCA GCGCGAAGTC TTTATACCGA 6000 
AAGGTTGGGC AGGCCAGCGT ATCGTGCTGC GTTTCGATGC GGTCACTCAT TACGGCAAAG 6 060 
TGTGGGTCAA TAATCAGGAA GTGATGGAGC ATCAGGGCGG CTATACGCCA TTTGAAGCCG 6120 

240 

ATGTCACGCC GTATGTTATT GCCGGGAAAA GTGTACAATT CACTGGCCGT CGTTTTACAA 618 0 
CGTCGTGACT GGGAAAACCC TGGCGTTACC CAACTTAATC GCCTTGCAGC ACATCCCCCT 6240 
245 TTCGCCAGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC 63 00 
AGCCTGAATG GCGAATGNNN NNNNAATTCA GTACATTAAA AACGTCCGCA ATGTGTTATT 6360 
AAGTTGTCTA AGCGTCAATT TGTTTACACC ACAATATATC CTGCCACCAG CCAGCCAACA 642 0 

250 

GCTCCCCGAC CGGCAGCTCG GCACAAAATC ACCACTCGAT ACAGGCAGCC CATCAGNNNN 64 80 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6540 
255 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6600 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6660 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6720 

260 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6780 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6840 
265 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7020 

270 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNN 7074 



<210> 2 
275 <211> 6750 
<212> DNA 

<213> Brome mosaic virus 



280 
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<400> 2 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 



AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 
GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
285 AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 

290 

TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 4 80 
295 TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 

300 

TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGG CGGCTG CATACGCTTG 720 
ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
305 GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 

310 

TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAG CGTTG GCTACCCGTG 1020 
ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
315 CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 12 00 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 

320 

TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
325 TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 
GAAAAGATGG CAAACG CTAA TAAGGGGGCT ATGACCGAAA ATGC CGATGA AAACGCGCTA 1620 



330 
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CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
335 GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 1740 
GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 1800 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 

340 

CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 1920 
AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 
345 CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
CTGCAGGTCG ACTCTAGAGG ATCCCCGGTC AACATGGTGG AGCACGACAC TCTCGTCTAC 2160 

350 

TCCAAGAATA TCAAAGATAC AGTCTCAGAA GACCAGAGGG CTATTGAGAC TTTTCAACAA 2220 
AGGGTAATAT CGGGAAACCT CCTCGGATTC CATTGCCCAG CTATCTGTCA CTTCATCGAA 2280 
355 AGGACAGTAG AAAAGGAAGA TGG CTTCTAC AAATGCCATC ATTGCGATAA AGGAAAGGCT 2340 
ATCGTTCAAG AATGCCTCTA CCGACAGTGG TCCCAAAGAT GGACCCCCAC CCACGAGGAA 2400 
CATCGTGGAA AAAGAAGACG TTCCAACCAC GTCTTCAAAG CAAGTGGATT GATGTGATAT 2460 

360 

CTCCACTGAC GTAAGGGATG ACGCACAATC CCACTATCCT TCGCAAGACC CTTCCTCTAT 2520 
ATAAGGAAGT TCATTTCATT TGGAGAGGAC CTCGACCACG GTTCTGCTAC TTGTTCTTTG 2580 
365 TTTTTCACCA ACAAAATGTC AAGTTCTATC GATTTGCTGA AGTTGATTGC TGAGAAGGGT 264 0 
GCTGACAGCC AGAGTGCCCA AGACATCGTA GACAATCAGG TTGCGCAACA GTTATCTGCG 2700 
CAGATTGAAT ACGCGAAAAG GTCTAAGAAA ATCAACGTTC GCAATAAGCT CTCTATTGAG 2 760 

370 

GAGGCTGACG CCTTC CGTGA CCGTTATGGT GGTGCCTTTG ACTTAAATTT GACTCAG CAG 2820 
TATCATGCGC CCCATAGCCT GGCTGGTGCT CTGCGTGTAG CGGAGCATTA TGACTGTCTC 2880 
375 GACAGTTTTC CCCCTGAAGA CCCCGTTATA GATTTCGGAG GGTCTTGGTG GCATCACTTT 2 940 
TCAAGAAGGG ATAAAAGGGT GCACAGTTGT TGTCCTGTGT TGGGTGTTAG AGACGCTGCC 3 000 
CGACATGAGG AGAGGATGTG CCGCATGCGA AAAATTTTGC AAGAAAGCGA TGATTTCGAT 3 060 

380 

GAAGTCCCGA ACTTTTGTCT TAACCGAGCT CAAGATTGTG ATGTCCAAGC TGATTGGGCT 3120 
ATCTGTATCC ACGGCGGTTA TGATATGGG C TTCCAAGGTC TGTGTGACGC CATGCATTCG 3180 
385 CATGGAGTAC GCGTACTACG TGGTACCGTT ATGTTCGACG GCGCCATGTT GTTTGACCGC 3240 
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GAGGGTTTTC TTCCCTTGCT TAAATGTCAC 
GTGATCAAAT TCGATTTTGA AAATGAAAGC 

390 

TTGGGCTCAT TTTTCACCGA GTCGGTGCAT 
CGCGAAATGC TGAAATGTAA CATCATGACC 
395 CCCCGGGAGA CACTACGTCA CTGTGTATGG 
■ TCAATACCTG AAGACTGGAG TCTCAATCGC 
GTGAGAGAGG TAGAGGAGAT AGCTTTCAGA 

400 

AACATGAAAG CTGTCGCATC TATCTTATCC 
CAGGCTATCA TGGCTGGTGA GCGCTTAGAC 
405 TTGACTTTGA ATCTGTATCA AAAGTACGAA 
TGGAAAGGTT GGTGCCATCA CTTCAAAACT 
GCGAAAGTAG GATGGCTGAG AACATTGGCT 

410 

TATGCGGACA GTTTTAAGTT TCTGACTCGT 
TCTGTACCGA TATCACGTTT GAGAACGTTT 
415 GAGCATGAAG TGCAGACAGC CAAGACCAAG 
GCTGCTGAGA TACCTCAGGA GGAGTTTCAT 
GTCAGTGATG ACGTTAAACC GGTGACTGAT 

420 

GTACCAACGG ACCCTCGTGG CATATCTAGA 
TGTAAGAGAT TACATAACAA CTCCGAGTCT 
425 GGTCGCGGAA GTGAGATCGC AAATAAGAGC 
ATGGTGAATG TCCATTTGGC CAACGGTAAC 
GTTGGATATA ATGAGCATGG TTTAGGTCCG 

430 

AAAACATGTG CATGCTCTAA CTTGAGGGAC 
CCTACATGCG ATATTTCCAT GGTTGATGGA 
435 AAAGATGCAT TCCGTATGGG AGAGGACCTA 
GACGTCAGGA TGGCTTTATT CCCTGACACT 
CGCACCGCGG ATTCTGCGAT CATGCACGGT 

440 
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TGGCAACGTG ACGGGTCAGG CGCGGATGAG 33 00 
ACATTATCTT ACATCCACGG ATGGCAAGAT 3360 
TGCATCGATG GAACCACCTA TCTGTTGGAG 3420 
TATAAGATCA TCGCTACAAA TTTACGCTGC 34 80 
TTTGAAGACA TATCTAAGTA CGTAGGGGTC 3540 
TGGAAATGTG TGCGCGTCGC CAAAACCACA 3600 
TGTTTCAAGG AAAGTAAAGA ATGGACTGAG 3660 
GCCAAGTCGT CGACTGTTAT TATTAACGGT 3720 
ATTGAAGATT ATCATCTAGT GGCCTTTGCT 3780 
AAGCTTACGG CCCTCCGCGA TGGGATGGAA 384 0 
AGGTTTTGGT GGGGTGGAGA TTCATCCAGG 3900 
AGCAGATTTC CCCTACTACG TCTGGATTCT 3 960 
CTCTCAAACG TTGAAGAATT TGAGCAAGAT 4 020 
TGGACTGAAG AGGACTTATT CGACCGGCTG 4 080 
CGCTCGAAGA AGAAGGCGAA AGTCCCGCCA 4140 
GATGCCCCTG AGAGTTCGAG CCCTGAGTCC 4200 
GTGGTCCCGG ATGCCGAGGT GTCTGTTGAG 4260 
CACGGAGCCA TGAAGGAATT TGTGCGTTAT 4320 
AATCTTCGTC ACCTATGGGA CATTTCCGGC 4380 
ATCTTTGAGA CCTACCATCG CATAGACGAT 4440 
TGGTTGTATC CTAAAAAATA CGATTACACT 4 500 
AAGCACGCAG ATGAAACGTA CATTGTTGAT 4 56 0 
ATTGCAGAAG CTAGCGCCAA AGTTTCTGTC 4620 
GTTGCGGGAT GCGGTAAAAC CACTGCCATA 4 680 
ATTGTGACGG CGAATCGTAA ATCGGCCGAG 4 740 
TATAATTCCA AGGTAGCTTT GGACGTTGTG 4 800 
GTACCGTCCT GTCATAGGCT GCTTGTTGAT 4 860 
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GAGGCTGGTT TACTACATTA TGGTCAACTC 
CAAGTTCTTG CCTTTGGGGA CACAGAGCAG 
445 AAATTGCTCC ACGGTAATCT GCAATATGAT 
TGTCCGCAAG ATGTTATCGC TGCTGTTAAT 
ACGAAGTATC AATCCTGGAC ATCTGAGTCC 

450 

ATTACTTCTG GTTTGCAGGT CACTATTGAT 
GCTGATAAAG CGGCCCTTCA AACGAGGGCT 
455 GATGGACACA TAAAAACAGT AGACGAAGCG 
GTTCGGCTTA AGTCGACCAA ATGTGATTTG 
TTAACACGAC ACAAGAAGTC CTTTGAGTAT 

460 

ATCTTTAATT GTGTTAAGTG ATGCGCTTGT 
AGCTCGG TAC CCGGGGATCC TCTAGAGTCC 
465 TCTCTCTCTA TTTTCTCCAG AATAATGTGT 
CTTATAGGGT TTCGCTCATG TGTTGAGCAT 
GTAAAATACT TCTATCAATA AAATTTCTAA 

470 

GTCAGTCCCT TATGTTACGT CCTGTAGAAA 
GCCTGTGGGC ATTCAGTCTG GATCGCGAAA 
475 G CGCGTTAC A AGAAAGCCGG GCAATTGCTG 
ATGCAGATAT TCGTAATTAT GCGGGCAACG 
AAGGTTGGGC AGGCCAGCGT ATCGTGCTGC 

480 

TGTGGGTCAA TAATCAGGAA GTGATGGAGC 
ATGTCACGCC GTATGTTATT GCCGGGAAAA 
485 CGTCGTGACT GGGAAAACCC TGGCGTTACC 
TTCGCCAGCT GGCGTAATAG CGAAGAGGCC 
AGCCTGAATG GCGAATGNNN NNNNAATTCA 

490 

AAGTTGTCTA AGCGTCAATT TGTTTACACC 
GCTCCCCGAC CGGCAGCTCG GCACAAAATC 
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CTGGTGGTGG CTGCTCTGTC TAAATGTTCA 4920 
ATTTCGTTCA AGTCTCGTGA CGCGGGTTTT 4980 
CGCCGTGACG TTGTTCACAA GACTTACCGG 5040 
CTGCTGAAGC GTAAATGCGG TAATAGGGAC 5100 
AAAGTTTCTA GAAGTCTCAC GAAGCGTCGT 5160 
CCGAACAGAA CGTATCTTAC GATGACTCAA 5220 
AAGGATTTTC CCGTGAGCAA GGACTGGATT 5280 
CAAGGGATCT CTGTTGACAA CGTCACTTTG 5340 
TTTAAACATG AGGAGTACTG TTTGGTTGCC 54 00 
TGCTTTAACG GCGAGCTCGC TGGTGATTTG 5460 
CTCTGTGTGA GACCTCTGCT CGAGAATTCG 5520 
GCAAATCACC AGTCTCTCTC TACAAATCTA 5580 
GAGTAGTTCC CAGATAAGGG AATTAGGGTT 564 0 
ATAAGAAACC CTTAGTATGT ATTTGTATTT 5700 
TTCCTAAAAC CAAAATCCAG TGACCGGGTG 5760 
CCCCAACCCG TGAAATCAAA AAACTCGACG 5820 
ACTGTGGAAT TGATCAGCGT TGGTGGGAAA 5880 
TGCCAGGCAG TTTTAACGAT CAGTTCGCCG 594 0 
TCTGGTATCA GCGCGAAGTC TTTATACCGA 6000 
GTTTCGATGC GGTCACTCAT TACGGCAAAG 6060 
ATCAGGGCGG CTATACGCCA TTTGAAGCCG 6120 
GTGTACAATT CACTGGCCGT CGTTTTACAA 6180 
CAACTTAATC GCCTTGCAGC ACATCCCCCT 6240 
CGCACCGATC GCCCTTCCCA ACAGTTGCGC 6300 
GTACATTAAA AACGTCCGCA ATGTGTTATT 6360 
ACAATATATC CTGCCACCAG CCAGCCAACA 6420 
ACCACTCGAT ACAGGCAGCC CATCAGNNNN 6480 
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495 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6540 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6600 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6660 

500 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6720 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6750 

505 

<210> 3 
<211> 6426 
<212> DNA 

<213> Brome mosaic virus 

510 

<400> 3 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 12 0 

515 

GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
520 TAGCAAATAT TTCTTGTCAA AAATG CTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
TG CACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 

525 

AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCG CAGGGG CGCCCGGTTC 480 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
530 TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 

535 

ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
540 CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 
TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 

545 

ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 114 0 
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550 NNNNNNNNNN NNNNNNNNNN GATCGTTCAA 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA 
TAATAATTAA CATGTAATGC ATGACGTTAT 

555 , 

CGCAATTATA CATTTAATAC G CG AT AGAAA 
TATCGCGCGC GGTGTCATCT ATGTTACTAG 
560 TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT 

565 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT 
570 GGCTCTAATT CCCAAATGGC TCAAGTCGGT 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA 
CCAATACGCA AACCGCCTCT CCCCGCGCGT 

575 

AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG 
CATTAGGCAC CCCAGGCTTT ACACTTTATG 
580 AGCGGATAAC AATTTCACAC AGGAAACAGC 
CTGCAGGTCA CTGGATTTTG GTTTTAGGAA 
AAATACAAAT ACATACTAAG GGTTTCTTAT 

585 

AACCCTAATT CCCTTATCTG GGAACTACTC 
TAGATTTGTA GAGAGAGACT GGTGATTTGC 
590 TAG CCAAAGT GGTCTGCCTG ACCAGGAGTT 
TTAGGTTCAT ATATCATAGA ACCGATCATC 
GGGACTTCAC GAGCAAAGCA TCAACTGACG 

595 

CGACCTTCTT TTTCAAGCGT TGTGTGGTCC 
GCTCAAGGGC CTCATGTAAA CGTCCGATCC 
600 CCGAGGAATA AATTCCAATG GTTCTGTAGA 
TCTTCCAGTT GTGAAACCAG TCACCATCAG 
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ACATTTGGCA AT AAAGTTT C TTAAGATTGA 1200 
TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 156 0 
ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 
GTCGCTACTG ATTACGGTGC TGCTATCGAT 168 0 
AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
GACGGTGATA ATTCACCTTT AATGAATAAT 180 0 
TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 186 0 
TGGCCGATTC ATTAATGCAG CTGGCACGAC 192 0 
CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 
CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
TTAGAAATTT TATTGATAGA AGTATTTTAC 2160 
ATGCTCAACA CATGAGCGAA ACCCTATAAG 222 0 
ACACATTATT CTGGAGAAAA TAGAGAGAGA 228 0 
GGACTCTAGA GGATCCCCAG CTTTTAAACT 234 0 
TTTAACCTTA ACCAAAGGGC TGTTCACAGC 2400 
TCAGATCAGA GGGCTTAAAA GTCTCACAAT 2460 
TTAGGCCTCC TCTACCGGTA GCGTAATCGT 2 520 
TACGATCATT AGCTAATTTG AGTGACTCAC 2580 
GTTTGACAGG GAGCTCCTTA GTACTACAGT 2640 
CTTTGTCTAA CACACCAGGA AACTTTGGAT 2700 
TTTTACGCTC TTCCGTGGTG CGTTTGAACT 2760 



605 
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TACATACAGG ATCGCTCATC TGATAAACTC TGATGCCTTC GGTACAGTAG CAATCAGAGA 2 820 

ACCTCAGGAA ATTCTCGGAG TATAAAGAAA AAGCCGCAAG AGCAGCTCTA ACCTCCTCGA 2880 

AAATCCAAGG TTTTTCTTTC CCATATTTCA GATAAACAAA ATGACAGAGC GTCGTAATCA 2 94 0 

610 TCTTCTCATC AAGTTGATTA ATAAACTTCA TTCGATCACA GAAGGAAACG AAATGTGCTC 3000 

TGAGCATCTG TTCATCACGC AGAATCTTTC GCTTAGCTAA GCGCTGGATC TCTCTCAGAG 3060 

GATCTGGTAC AGACACCAAA TTGCCCATTT CAGTTTCGAC GAGAAACTTA CTACAAACGT 3120 

615 

AGGGCACACT AGGGTCCATG ACTTTTATCT CCATATTGAA GAGAGACGTA AACATATCGG 3180 

TATCCAGGAC TGG CTTAACT TTAGAGATGA TTAAAGAATC ATCTCCTGAA AATATTGCAC 3240 

620 AGTCACAGTC ACTTAGATCA GAGGCATATG CAATCATAGC CATAGTGACA AGAGTATTAC 3300 

CGAAATATGT AAACGCGTCA CCAGTTCTGC GTTGGAAGGA AACGGACATT CCCACCTTGG 3360 

CATGAGGGTC TGATAAATAA GAATCGCGAT GAAAATCAGA CCACCAATTC GTCAGCGGCG 3420 

625 

CTGGAAAGCC CAGCGCAAGG AGTATCTCTC TCTGAAACTC TAGGTGCAGC TCACCCTGAG 3480 

ATTTATCAAA TTTGCTTAGG TCCGCTTCAA GAAAGTATCT GTTATTCAAG CGGACATTCT 3540 

630 TAAGCTCCAG AGAGGATATC TTTCCGATAG GCACAATGAA CCTGGATTTC AGGGCCAGTG 3600 

ATAACTTCTC GAAACAAGCA GTGAAAAAGG GTGAAAAATT ACTAGTCACA CCTTTACTAT 3660 

GAAATGTTAT AGTAGCTGCT ACTGCTCGTT CCAAGTGAAG GGTGTCAGTT ACAACAGGTT 3 720 

635 

TTACGTCAGA CTTCAGCATA TGCTGGTACC GACATAAATC AGTCTCTGCT GCCACATTCA 3780 

CACCTTGCAA GTCCATGTGC TTACCCCACT TCTTATGGTA CTCAAGACAT TTAGTCATGA 384 0 

640 CATCCATAGA AGCTCTCAGA CAGTCTTCAC CGTCAACATT AAGGAATGTG CTACGAAAGC 3900 

G CTTTGCTAT AGCTTTCGCA GTGTC CTTCA TGTTAATCGC GTCTCCCATT TCTGGAACGT 3960 

CCGCGTTTCG CTTTTTGAGT GCGGTTAAGA CTTCTTTCTG AGTACCAACT CTTCGCTGAG 4020 

645 

CACTCCCGAT ATTCATTTTT GGTTGAAAAT ATTTATCGGG GTCCCTATAC CAGTCTACAT 4080 

CACTTTGCTT AAGTCTGATC CTATCAAAGT CCATGGAATA ATCACCATTT TCAACAAGGG 4140 

650 CTTGATGGTA CGAATCATCG AAATAAGCAT GGGTTGGCAG TATGGAATGA CTGGTCGCTT 4200 

CTGTTCTAGC AAGGCTGACT CTCTCCATAT AAATTGGCCC AGTAGAGATG TCAGGGTTAT 4260 

CTGGATGGCA GTGTGTATCA ATAACACGCG AAACCCTATG TTCAATAGGG TTCATGATTT 4320 

GAAGAGTGAT GTCGTAATCA GTATTAGTAG TCTGAAACTC TTCATCAATG CCCATGTACC 4380 
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TATCTCCAAG GGTCAGCTCC TTGGGGGTAT CTCCAGTAAC ACGAACTTCC TCAATTTCAC 4440 
660 AGTTCGAGGA ATCACTGGCG AGTTTTAGAT CGCTCGCATG ATCTTCATCG GCGGCAAACG 4500 
ATACACCGTA ACCATCACTA GTATCCTCGG GATACCAGTC ATCAATTTCA TCTTCGAGCA 4560 
CGAAAGAGCC CGGAATGTCA AGATATAACA TCCGTGCCAT TTCAGCTTGA GGAATCAGCG 4620 

665 

GTCTATCGGT GAACTGTTGA ACCATTTGTT GGACGGTGTC GCAAATAGAG CCCCAGCGCA 4680 
CTCGGTCAAA AGGGGGATCG AATACCCCTC CTATCTCCAA GGGCGCTATA GCTAATTTAA 4740 
670 AACTCGCGAG AGATCCGTCA ATGGCAACTC CGTCTGCCGG CTCCTGCACC TGAAGGCTAG 4800 
CAGCCTCCAC CTCGTCTTCT AAGGATTGAT CTATGATCCA TTGGAAAGAC GGGACCTGGC 4860 
GAACGAAATC ATCATCCCAG GTTTTCGAAG ACATCTTGGT GATAGTAGAA AGAACAAGCA 492 0 

675 

CACAACAACA ACAAGGTCAG ATGTGTGTTG CGGGTACCGA GCTCGAATTC TCGAGGTCCT 4 98 0 
CTCCAAATGA AATGAACTTC CTTATATAGA GGAAGGGTCT TGCGAAGGAT AGTGGGATTG 504 0 
680 TGCGTCATCC CTTACGTCAG TGGAGATATC ACATCAATCC ACTTGCTTTG AAGACGTGGT 5100 
TGGAACGTCT TCTTTTTCCA CGATGTTCCT CGTGGGTGGG GGTCCATCTT TGGGACCACT 5160 
GTCGGTAGAG GCATTCTTGA ACGATAGCCT TTCCTTTATC GCAATGATGG CATTTGTAGA 5220 

685 

AGCCATCTTC CTTTTCTACT GTCCTTTCGA TGAAGTGACA GATAGCTGGG CAATGGAATC 5280 
CGAGGAGGTT TCCCGATATT ACCCTTTGTT GAAAAGTCTC AATAGCCCTC TGGTCTTCTG 534 0 
690 , AGACTGTATC TTTGATATTC TTGGAGTAGA CGAGAGTGTC GTGCTCCACC ATGTTGACCT 54 00 
GCAGGCAGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGG TGGTCAGTCC 5460 
CTTATGTTAC GTCCTGTAGA AACCCCAACC CGTGAAATCA AAAAACTCGA CGGCCTGTGG 5520 

695 

GCATTCAGTC TGGATCGCGA AAACTGTGGA ATTGATCAGC GTTGGTGGGA AAGCGCGTTA 5580 
CAAGAAAGCC GGGCAATTGC TGTGCCAGGC AGTTTTAACG ATCAGTTCGC CGATGCAGAT 5640 
700 ATTCGTAATT ATGCGGGCAA CGTCTGGTAT CAGCGCGAAG TCTTTATACC GAAAGGTTGG 5700 
GCAGGCCAGC GTATCGTGCT GCGTTTCGAT GCGGTCACTC ATTACGGCAA AGTGTGGGTC 5760 
AATAATCAGG AAGTGATGGA GCATCAGGGC GGCTATACGC CATTTGAAGC CGATGTCACG 5820 

705 

CCGTATGTTA TTGCCGGGAA AAGTGTACAA TTCACTGGCC GTCGTTTTAC AACGTCGTGA 5880 
CTGGGAAAAC CCTGGCGTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG 594 0 
710 CTGGCGTAAT AGCGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGC CTGAA 6000 
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TGGCGAATGN NNNNNNAATT CAGTACATTA AAAACGTCCG CAATGTGTTA TTAAGTTGTC 6060 
TAAGCGTCAA TTTGTTTACA CCACAATATA TCCTGCCACC AGCCAGCCAA CAGCTCCCCG 612 0 
ACCGGCAGCT CGGCACAAAA TCACCACTCG ATACAGGCAG CCCATCAGNN NNNNNNNNNN 6X80 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6240 
720 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 63 00 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6360 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6420 
NNNNNN 6426 



725 



<210> 4 
730 <211> 6500 
<212> DNA 

<213> Brome mosaic virus 
<400> 4 

735 AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 6 0 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 
GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 

740 

AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
745 AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 42 0 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 48 0 

750 

TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 54 0 
TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
755 CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG -720 
ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 

760 

GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
765 CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 
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TCGACTGTGG CCGGCTGGGT GTGGCGGACC 
ATATTGCTGA AGAGCTTGGC GGCGAATGGG 

770 

CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT 
NNNNNNNNNN NNNNNNNNNN GATCGTTCAA 
775 ATCCTGTTGC CGGTCTTGCG ATGATTATCA 
TAATAATTAA CATGTAATGC ATGACGTTAT 
CGCAATTATA CATTTAATAC GCGATAGAAA 

780 

TATCGCGCGC GGTGTCATCT ATGTTACTAG 
TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG 
785 GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT 
CAGTCTGACG CTAAAGGCAA ACTTGATTCT 

790 

GGTTTCATTG GTGACGTTTC CGGCCTTGCT 
GGCTCTAATT CCCAAATGGC TCAAGTCGGT 
795 TTCCGTCAAT ATTTACCTTC CCTCCCTCAA 
CCAATACGCA AACCGCCTCT CCCCGCGCGT 
AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG 

800 

CATTAGGCAC CCCAGGCTTT ACACTTTATG 
AGCGGATAAC AATTTCACAC AGGAAACAGC 
805 TGCAGGTCAA CATGGTGGAG CACGACACTC 
TCTCAGAAGA CCAGAGGGCT ATTGAGACTT 
TCGGATTCCA TTGCCCAGCT ATCTGTCACT 

810 

GCTTCTACAA ATG CCATCAT TGCGATAAAG 
GACAGTGGTC CCAAAGATGG ACCCCCACCC 
815 CCAACCACGT CTTCAAAGCA AGTGGATTGA 
GCACAATCCC ACTATCCTTC GCAAGACCCT 
GAGAGGACCT CGAGAATTCG AGCTCGGTAC 

820 
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GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 
CTGACCGCTT CCTCGTG CTT TACGGTATCG 1080 
ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
ATCGGGCCTC CTGT CAATGC TGGCGGCGGC 144 0 
GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 
ATGACCGAAA ATGC CGATGA AAACGCGCTA 1620 
GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
GACGGTGATA ATTCACCTTT AATGAATAAT 18 00 
TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
TGGCCGATTC ATTAATGCAG CTGGCACGAC 192 0 
CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 
CTTCCGGCTC GTATGTTGTG TGGAATTGTG 2040 
TATGACCATG ATTACGCCAA GCTTGCTGCC 2100 
TCGTCTACTC CAAGAATATC AAAGATACAG 2160 
TTCAACAAAG GGTAATATCG GGAAACCTCC 2220 
TCATCGAAAG GACAGTAGAA AAGGAAGATG 2280 
GAAAGGCTAT CGTTCAAGAA TGCCTCTACC 234 0 
ACGAGGAACA TCGTGGAAAA AGAAGACGTT 24 00 
TGTGATATCT CCACTGACGT AAGGGATGAC 2460 
TCCTCTATAT AAGGAAGTTC ATTTCATTTG 2520 
CCGCAACACA CATCTGACCT TGTTGTTGTT 2580 



830 
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GTGTGCTTGT TCTTTCTACT ATCACCAAGA TGTCTTCGAA AACCTGGGAT GATGATTTCG 264 0 
TTCGCCAGGT CCCGTCTTTC CAATGGATCA TAGATCAATC CTTAGAAGAC GAGGTGGAGG 2700 
825 CTG CTAGCCT TCAGGTGCAG GAGCCGGCAG ACGGAGTTGC CATTGACGGA TCTCTCGCGA 2760 
GTTTTAAATT AGCTATAGCG CCCTTGGAGA TAGGAGGGGT ATTCGATCCC CCTTTTGACC 282 0 
GAGTGCGCTG GGGCTCTATT TGCGACACCG TCCAACAAAT GGTTCAACAG TTCACCGATA 2880 
GACCGCTGAT TCCTCAAGCT GAAATGGCAC GGATGTTATA TCTTGACATT CCGGGCTCTT 2940 
TCGTGCTCGA AGATGAAATT GATGACTGGT ATC CCGAGG A TACTAGTGAT GGTTACGGTG 3000 
835 TATCGTTTGC CGCCGATGAA GATCATGCGA GCGATCTAAA ACTCGCCAGT GATTCCTCGA 3060 
ACTGTGAAAT TGAGGAAGTT CGTGTTACTG GAGATACCCC CAAGGAGCTG ACCCTTGGAG 3120 
ATAGGTACAT GGGCATTGAT GAAGAGTTTC AGACTACTAA TACTGATTAC GACATCACTC 3180 
TTCAAATCAT GAACCCTATT GAACATAGGG TTTCGCGTGT TATTGATACA CACTGCCATC 324 0 
CAGATAACCC TGACATC TCT ACTGGGCCAA TTTATATGGA GAGAGTCAGC CTTGCTAGAA 3300 
845 CAGAAGCGAC CAGTCATTCC ATACTGCCAA CCCATGCTTA TTTCGATGAT TCGTACCATC 3360 
AAGCCCTTGT TGAAAATGGT GATTATTCCA TGGACTTTGA TAGGATCAGA CTTAAGCAAA 3420 
GTGATGTAGA CTGGTATAGG GACCCCGATA AATATTTTCA ACCAAAAATG AATATCGGGA 34 8 0 
GTGCTCAGCG AAGAGTTGGT ACTCAGAAAG AAGTCTTAAC CGCACTCAAA AAGCGAAACG 3 54 0 
CGGACGTTCC AGAAATGGGA GACGCGATTA ACATGAAGGA CACTGCGAAA GCTATAGCAA 3600 
855 AGCGCTTTCG TAGCACATTC CTTAATGTTG ACGGTGAAGA CTGTCTGAGA GCTTCTATGG 3660 
ATGTCATGAC TAAATGT CTT GAGTACCATA AGAAGTGGGG TAAGCACATG GACTTGCAAG 372 0 
GTGTGAATGT GGCAG CAGAG ACTGATTTAT GTCGGTACCA GCATATGCTG AAGTCTGACG 378 0 
TAAAACCTGT TGTAACTGAC ACCCTTCACT TGGAACGAGC AGTAGCAGCT ACTATAACAT 384 0 
TTCATAGTAA AGGTGTGACT AGTAATTTTT CACCCTTTTT CACTGCTTGT TTCGAGAAGT 3900 
865 TATCACTGGC CCTGAAATCC AGGTTCATTG TGCCTATCGG AAAGATATCC TCTCTGGAGC 3 960 
TTAAGAATGT CCGCTTGAAT AACAGATACT TTCTTGAAGC GGACCTAAGC AAATTTGATA 4020 
AATCTCAGGG TGAGCTGCAC CTAGAGTTTC AGAGAGAGAT ACTCCTTGCG CTGGGCTTTC 4080 
CAGCGCCGCT GACGAATTGG TGGTCTGATT TTCATCGCGA TTCTTATTTA TCAGACCCTC 4140 
ATGCCAAGGT GGGAATGTCC GTTTCCTTCC AACGCAGAAC TGGTGACGCG TTTACATATT 4200 



850 



860 



870 
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875 TCGGTAATAC TCTTGTCACT ATGGCTATGA TTGCATATGC CTCTGATCTA AGTGACTGTG 4260 
ACTGTGCAAT ATTTTCAGGA GATGATTCTT TAATCATCTC TAAAGTTAAG CCAGTCCTGG 4320 
ATACCGATAT GTTTACGTCT CTCTTCAATA TGGAGATAAA AGTCATGGAC CCTAGTGTGC 4380 

880 

CCTACGTTTG TAGTAAGTTT CTCGTCGAAA CTGAAATGGG CAATTTGGTG TCTGTACCAG 4440 
ATCCTCTGAG AGAGATCCAG CGCTTAGCTA AGCGAAAGAT TCTGCGTGAT GAACAGATGC 4 500 
885 TCAGAGCACA TTTCGTTTCC TTCTGTGATC GAATGAAGTT TATTAATCAA CTTGATGAGA 4560 
AGATGATTAC GACGCTCTGT CATTTTGTTT ATCTGAAATA TGGGAAAGAA AAACCTTGGA 4620 
TTTTCGAGGA GGTTAGAGCT GCTCTTGCGG CTTTTTCTTT ATACTCCGAG AATTTCCTGA 4 680 

890 

GGTTCTCTGA TTGCTACTGT ACCGAAGGCA TCAGAGTTTA TCAGATGAGC GATCCTGTAT 4740 
GTAAGTTCAA ACGCACCACG GAAGAG CGT A AAACTGATGG TGACTGGTTT CACAACTGGA 4800 
895 AGAATCCAAA GTTTCCTGGT GTGTTAGACA AAGTCTACAG AACCATTGGA ATTTATTCCT 4 860 
CGGACTGTAG TACTAAGGAG CTCCCTGTCA AACGGATCGG ACGTTTACAT GAGGCCCTTG 4920 
AGCGTGAGTC ACT CAAATTA GCTAATGATC GTAGGACCAC ACAACGCTTG AAAAAGAAGG 4 980 

900 

TCGACGATTA CGCTACCGGT AGAGGAGGCC TAACGTCAGT TGATGCTTTG CTCGTGAAGT 5040 
CCCATTGTGA GACTTTTAAG CCCTCTGATC TGAGATGATC GGTTCTATGA TATATGAACC 5100 
905 TAAGCTGTGA ACAGCCCTTT GGTTAAGGTT AAAAACTCCT GGTCAGGCAG ACCACTTTGG 5160 
CTAAGTTTAA AAGCTGGGGA TCCTCTAGAG TCCGCAAATC ACCAGTCTCT CTCTACAAAT 5220 
CTATCTCTCT CTATTTTCTC CAGAATAATG TGTGAGTAGT TCCCAGATAA GGGAATTAGG 5280 

910 

GTTCTTATAG GGTTTCGCTC ATGTGTTGAG CATATAAGAA ACCCTTAGTA TGTATTTGTA 5340 
TTTGTAAAAT ACTTCTATCA ATAAAATTTC TAATTCCTAA AACCAAAATC CAGTGACCTG 5400 
915 CAGGCATGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGG TGGTCAGTCC 5460 
CTTATGTTAC GTCCTGTAGA AACCCCAACC CGTGAAATCA AAAAACTCGA CGGCCTGTGG 5520 
GCATTCAGTC TGGATCGCGA AAACTGTGGA ATTGATCAGC GTTGGTGGGA AAGCGCGTTA 5580 

920 

CAAGAAAGCC GGGCAATTGC TGTGCCAGGC AGTTTTAACG ATCAGTTCGC CGATGCAGAT 5640 
ATTCGTAATT ATGCGGGCAA CGTCTGGTAT CAGCGCGAAG TCTTTATACC GAAAGGTTGG 5700 
925 GCAGGCCAGC GTATCGTGCT GCGTTTCGAT GCGGTCACTC ATTACGGCAA AGTGTGGGTC 5760 
AATAATCAGG AAGTGATGGA GCATCAGGGC GGCTATACGC CATTTGAAGC CGATGTCACG 5820 
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CCGTATGTTA TTGCCGGGAA AAGTGTACAA 

930 

CTGGGAAAAC CCTGGCGTTA CCCAACTTAA 
CTGGCGTAAT AGCGAAGAGG CCCGCACCGA 
935 TGGCGAATGN NNNNNNAATT CAGTACATTA 
TAAGCGTCAA TTTGTTTACA C CACAAT AT A 
ACCGGCAGCT CGGCACAAAA TCACCACTCG 

940 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
945 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

950 

NNNNNNNNNN NNNNNNNNNN 



<210> 5 
955 <211> 10100 
<212> DNA 

<213> Brome mosaic virus y 
<400> 5 

960 AAACACTGAT AGTTTAAACT GAAGGCGGGA 
AGGGAGTCAC GTTATGACCC CCGCCGATGA 
GACAGAACCG CAACGATTGA AGGAGCCACT 

965 

AAGCACATAC GTCAGAAACC ATTATTGCGC 
TAG CAAATAT TTCTTGTCAA AAATGCTCCA 
970 AATTAGNNNN NNNNNNNNNN NNNNNNNNNN 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT 

975 

TTTTTGTCAA GACCGACCTG TCCGGTGCCC 
TATCGTGGCT GGCCACGACG GGCGTTCCTT 
980 CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG 
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TTCACTGGCC GTCGTTTTAC AACGTCGTGA 5880 
TCGCCTTGCA GCACATCCCC CTTTCGCCAG 5940 
TCGCCCTTCC CAACAGTTGC GCAGCCTGAA 6000 
AAAACGTCCG CAATGTGTTA TTAAGTTGTC 6060 
TCCTGCCACC AGCCAGCCAA CAGCTCCCCG 6120 
ATACAGGCAG CCCATCAGNN NNNNNNNNNN 6180 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6240 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6300 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6360 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6420 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6480 

6500 



AACGACAATC TGATCATGAG CGGAGAATTA 6 0 
CGCGGGACAA GCCGTTTTAC GTTTGGAACT 12 0 
CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
GTTCAAAAGT CGCCTAAGGT CACTATCAGC 24 0 
CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 
TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 4 80 
TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
CTGATGCAAT GCGGCGGCTG CATACGCTTG 72 0 
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ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 

985 

GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 84 0 



CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATG CCCGA CGGCGATGAT CTCGTCGTGA 900 
990 CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 96 0 



TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 102 0 



ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 108 0 

995 

CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 114 0 



NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
1000 ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 



TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 132 0 
CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 13 8 0 

1005 

TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 144 0 



TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 15 00 
1010 GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 



GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 



CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 168 0 

1015 

GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 



GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCAC CTTT AATGAATAAT 18 00 



1020 TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 

CCAATACGCA AACCGCCTCT CCCCGCGCGT TGG CCGATTC ATTAATGCAG CTGGCACGAC 1920 

AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 198 0 

1025 

CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 



AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCATGC 2100 



1030 CTGCAGGTCA CTGGATTTTG GTTTTAGGAA TTAGAAATTT TATTGATAGA AGTATTTTAC 216 0 



AAATACAAAT ACATACTAAG GGTTTCTTAT ATGCTCAACA CATGAGCGAA ACCCTATAAG 2220 
AAC CCTAATT CCCTTATCTG GGAACTACTC ACACATTATT CTGGAGAAAA TAGAGAGAGA 2280 

1035 

TAGATTTGTA GAGAGAGACT GGTGATTTGC GGACTCTAGA GGATCCCCAG CTTTTAAACT 2340 
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TAGCCAAAGT GGTCTGCCTG ACCAGGAGTT TTTAACCTTA ACCAAAGGGC TGTTCACAGC 24 00 
1040 TTAGGTTCAT ATATCATAGA ACCGATCATC TCAGATCAGA GGGCTTAAAA GTCTCACAAT 2460 
GGGACTTCAC GAGCAAAGCA TCAACTGACG TTAGGCCTCC TCTACCGGTA GCGTAATCGT 2520 
CGACCTTCTT TTTCAAGCGT TGTGTGGTCC TACGATCATT AGCTAATTTG AGTGACTCAC 2 580 

1045 

G CTCAAGGGC CTCATGTAAA CGTCCGATCC GTTTGACAGG GAGCTCCTTA GTACTACAGT 2640 
CCGAGGAATA AATTCCAATG GTTCTGTAGA CTTTGTCTAA CACACCAGGA AACTTTGGAT 2700 
1050 TCTTCCAGTT GTGAAACCAG TCACCATCAG TTTTACGCTC TTCCGTGGTG CGTTTGAACT 2760 
TACATACAGG ATCGCTCATC TGATAAACTC TGATGCCTTC GGTACAGTAG CAATCAGAGA 2820 
ACCTCAGGAA ATTCTCGGAG TATAAAGAAA AAGCCGCAAG AGCAGCTCTA ACCTCCTCGA 2880 

1055 

AAATCCAAGG TTTTTCTTTC CCATATTTCA GATAAACAAA ATGACAGAGC GTCGTAATCA 2940 
TCTTCTCATC AAGTTGATTA ATAAACTTCA TTCGATCACA GAAGGAAACG AAATGTGCTC 3000 
1060 TGAGCATCTG TTCATCACGC AGAATCTTTC GCTTAGCTAA GCGCTGGATC TCTCTCAGAG 3060 
GATCTGGTAC AGACACCAAA TTGCCCATTT CAGTTTCGAC GAGAAACTTA CTACAAACGT 3120 
AGGGCACACT AGGGTCCATG ACTTTTATCT CCATATTGAA GAGAGACGTA AACATATCGG 3180 

1065 

TATCCAGGAC TGGCTTAACT TTAGAGATGA TTAAAGAATC ATCTCCTGAA AATATTGCAC 3240 
AGTCACAGTC ACTTAGATCA GAGGCATATG CAATCATAGC CATAGTGACA AGAGTATTAC 33 00 
1070 CGAAATATGT AAACGCGTCA CCAGTTCTGC GTTGGAAGGA AACGGACATT CCCACCTTGG 3360 
CATGAGGGTC TGATAAATAA GAATCGCGAT GAAAATCAGA CCACCAATTC GTCAGCGGCG 342 0 
CTGGAAAGCC CAGCGCAAGG AGTATCTCTC TCTGAAACTC TAGGTGCAGC TCACCCTGAG 3480 

1075 

ATTTATCAAA TTTGCTTAGG TCCGCTTCAA GAAAGTATCT GTTATTCAAG CGGACATTCT 3540 
TAAGCTCCAG AGAGGATATC TTTCCGATAG GCACAATGAA CCTGGATTTC AGGGCCAGTG 3600 
1080 ATAACTTCTC GAAACAAGCA GTGAAAAAGG GTGAAAAATT ACTAGTCACA CCTTTACTAT 3 660 
GAAATGTTAT AGTAGCTGCT ACTGCTCGTT CCAAGTGAAG GGTGTCAGTT ACAACAGGTT 3 720 
TTACGTCAGA CTTCAGCATA TGCTGGTACC GACATAAATC AGTCTCTGCT GCCACATTCA 3780 

1085 

CACCTTGCAA GTCCATGTGC TTACCCCACT TCTTATGGTA CTCAAGACAT TTAGTCATGA 3840 
CATCCATAGA AGCTCTCAGA CAGTCTTCAC CGTCAACATT AAGGAATGTG CTACGAAAGC 3900 
1090 GCTTTGCTAT AGCTTTCGCA GTGTCCTTCA TGTTAATCGC GTCTCCCATT TCTGGAACGT 3960 
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CCGCGTTTCG CTTTTTGAGT GCGGTTAAGA CTTCTTTCTG AGTACCAACT CTTCGCTGAG 4020 
CACTCCCGAT ATTCATTTTT GGTTGAAAAT ATTTATCGGG GTCCCTATAC CAGTCTACAT 4080 

1095 

CACTTTGCTT AAGTCTGATC CTATCAAAGT CCATGGAATA AT C AC CATTT TCAACAAGGG 414 0 
CTTGATGGTA CGAATCATCG AAATAAGCAT GGGTTGGCAG TATGGAATGA CTGGTCG CTT 4200 
1100 CTGTTCTAGC AAGGCTGACT CTCTCCATAT AAATTGGCCC AGTAGAGATG TCAGGGTTAT 4260 
CTGGATGGCA GTGTGTATCA ATAACACGCG AAACCCTATG TTCAATAGGG TTCATGATTT 4320 
GAAGAGTGAT GTCGTAATCA GTATTAGTAG TCTGAAACTC TTCATCAATG CCCATGTACC 4380 

1105 

TATCTCCAAG GGTCAGCTCC TTGGGGGTAT CTCCAGTAAC ACGAACTTCC TCAATTTCAC 444 0 
AGTTCGAGGA ATCACTGGCG AGTTTTAGAT CGCTCGCATG ATCTTCATCG GCGGCAAACG 4 500 
1110 ATACACCGTA ACCATCACTA GTATCCTCGG GATACCAGTC ATCAATTTCA TCTTCGAGCA 4 56 0 
CGAAAGAGCC CGGAATGTCA AGATATAACA TCCGTGCCAT TTCAGCTTGA GGAATCAGCG 4 620 
GTCTATCGGT GAACTGTTGA ACCATTTGTT GGACGGTGTC GCAAATAGAG CCCCAGCGCA 4680 

1115 

CTCGGTCAAA AGGGGGATCG AATACCCCTC CTATCTCCAA GGGCGCTATA GCTAATTTAA 474 0 
AACTCGCGAG AGATCCGTCA ATGGCAACTC CGTCTGCCGG CTCCTGCACC TGAAGGCTAG 4 800 
1120 CAGCCTCCAC CTCGTCTTCT AAGGATTGAT CTATGATCCA TTGGAAAGAC GGGACCTGGC 4860 
GAACGAAATC ATCATCCCAG GTTTTCGAAG ACATCTTGGT GATAGTAGAA AGAACAAGCA 4 920 
CACAACAACA ACAAGGTCAG ATGTGTGTTG CGGGTACCGA GCTCGAATTC TCGAGGTCCT 498 0 

1125 

CTCCAAATGA AATGAACTTC CTTATATAGA GGAAGGGTCT TGCGAAGGAT AGTGGGATTG 504 0 
TGCGTCATCC CTTACGTCAG TGGAGATATC ACATCAATCC ACTTGCTTTG AAGACGTGGT 510 0 
1130 TGGAACGTCT TCTTTTTCCA CGATGTTCCT CGTGGGTGGG GGTCCATCTT TGGGACCACT 5160 
GTCGGTAGAG GCATTCTTGA ACGATAGCCT TTCCTTTATC GCAATGATGG CATTTGTAGA 5220 
AG CCATCTTC CTTTTCTACT GTCCTTTCGA TGAAGTGACA GAT AG CTGGG CAATGGAATC 528 0 

1135 

CGAGGAGGTT TCCCGATATT ACCCTTTGTT GAAAAGTCTC AATAGCCCTC TGGTCTTCTG 534 0 
AGACTGTATC TTTGATATTC TTGGAGTAGA CGAGAGTGTC GTGCTCCACC ATGTTGACCT 5400 
1140 GCAGGCAGCA AGCTTGCATG CCTGCAGGTC GACTC TAGAG GATCCCCGGT CAACATGGTG 546 0 
GAGCACGACA CTCTCGTCTA CTCCAAGAAT ATCAAAGATA CAGTCTCAGA AGACCAGAGG 5520 
GCTATTGAGA CTTTTCAACA AAGGGTAATA TCGGGAAACC TCCTCGGATT CCATTGCCCA 5580 

1145 
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GCTATCTGTC ACTTCATCGA AAGGACAGTA 
CATTGCGATA AAGGAAAGGC TATCGTTCAA 
1 1 50 TGGACCCCCA CCCACGAGGA ACATCGTGGA 
GCAAGTGGAT TGATGTGATA TCTCCACTGA 
TTCGCAAGAC CCTTCCTCTA TATAAGGAAG 

1155 

GGTTCTGCTA CTTGTTCTTT GTTTTTCACC 
AAGTTGATTG CTGAGAAGGG TGCTGACAGC 
1 1 60 GTTGCGCAAC AGTTATCTGC GCAGATTGAA 
CGCAATAAGC TCTCTATTGA GGAGGCTGAC 
GACTTAAATT TGACTCAGCA GTATCATGCG 

1165 

GCGGAGCATT ATGACTGTCT CGACAGTTTT 
GGGTCTTGGT GGCATCACTT TTCAAGAAGG 
1170 TTGGGTGTTA GAGACGCTGC CCGACATGAG 
CAAGAAAGCG ATGATTTCGA TGAAGTCCCG 
GATGTCCAAG CTGATTGGGC TATCTGTATC 

1175 

CTGTGTGACG CCATGCATTC GCATGGAGTA 
GGCGCCATGT TGTTTGACCG CGAGGGTTTT 
1180 GACGGGTCAG GCGCGGATGA GGTGATCAAA 
TACATCCACG GATGGCAAGA TTTGGGCTCA 
GGAACCACCT ATCTGTTGGA GCGCGAAATG 

1185 

ATCGCTACAA ATTTACG CTG CCCCCGGGAG 
ATATCTAAGT ACGTAGGGGT CTCAATACCT 
1190 GTGCGCGTCG CCAAAACCAC AGTGAGAGAG 
GAAAGTAAAG AATGGACTGA GAACATGAAA 
TCGACTGTTA TTATTAACGG TCAGGCTATC 

1195 

TATCATCTAG TGGCCTTTGC TTTGACTTTG 
GCCCTCCGCG ATGGGATGGA ATGGAAAGGT 



22 

GAAAAGGAAG . ATGGCTTCTA CAAATGCCAT 564 0 
GAATGCCTCT ACCGACAGTG GTCCCAAAGA 5700 
AAAAGAAGAC GTTCCAACCA CGTCTTCAAA 5760 
CGTAAGGGAT GACGCACAAT CCCACTATCC 582 0 
TTCATTTCAT TTGGAGAGGA CCTCGACCAC 5880 
AACAAAATGT CAAGTTCTAT CGATTTG CTG 5940 
CAGAGTGCCC AAGACATCGT AGACAATCAG 6000 
TACGCGAAAA GGTCTAAGAA AATCAACGTT 6060 
GCCTTCCGTG ACCGTTATGG TGGTGCCTTT 6X20 
CCCCATAGCC TGGCTGGTGC TCTGCGTGTA 618 0 
CCCCCTGAAG ACCCCGTTAT AGATTTCGGA 624 0 
GATAAAAGGG TGCACAGTTG TTGTCCTGTG 6300 
GAGAGGATGT GCCGCATGCG AAAAATTTTG 6360 
AACTTTTGTC TTAACCGAGC TCAAGATTGT 6420 
CACGGCGGTT ATGATATGGG CTTCCAAGGT 6480 
CGCGTACTAC GTGGTAC CGT TATGTTCGAC 654 0 
CTTCCCTTGC TTAAATGTCA CTGGCAACGT 6600 
TTCGATTTTG AAAATGAAAG CACATTATCT 666 0 
TTTTTCACCG AGT CGGTGCA TTGCATCGAT 6720 
CTGAAATGTA ACATCATGAC CT AT AAG AT C 6780 
ACACTACGTC ACTGTGTATG GTTTGAAGAC 6840 
GAAGACTGGA GTCTCAATCG CTGGAAATGT 6900 
GTAGAGGAGA TAG CTTTCAG ATGTTTCAAG 6960 
GCTGTCGCAT CTATCTTATC CGCCAAGTCG 7020 
ATGGCTGGTG AGCGCTTAGA CATTGAAGAT 7080 
AATCTGTATC AAAAGTACGA AAAGCTTACG 714 0 
TGGTGCCATC ACTTCAAAAC TAGGTTTTGG 7200 
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1200 TGGGGTGGAG ATTCATCCAG GGCGAAAGTA 
CCCCTACTAC GTCTGGATTC TTATG CGGAC 
GTTGAAGAAT TTGAGCAAGA TTCTGTACCG 

1205 

GAGGACTTAT TCGACCGGCT GGAGCATGAA 
AAGAAGGCGA AAGTCCCGCC AGCTGCTGAG 
1210 GAGAGTTCGA GCCCTGAGTC CGTCAGTGAT 
GATGCCGAGG TGTCTGTTGA GGTACCAACG 
ATGAAGGAAT TTGTGCGTTA TTGTAAGAGA 

1215 

CACCTATGGG ACATTTCCGG CGGTCGCGGA 
ACCTACCATC GCATAGACGA TATGGTGAAT 
1220 CCTAAAAAAT ACGATTACAC TGTTGGATAT 
GATGAAACGT ACATTGTTGA TAAAACATGT 
GCTAGCGCCA AAGTTTCTGT CCCTACATGC 

1225 

TGCGGTAAAA CCACTGCCAT AAAAGATGCA 
GCGAATCGTA AATCGGCCGA GGACGTCAGG 
1230 AAGGTAGCTT TGGACGTTGT GCGCACCGCG 
TGTCATAGGC TGCTTGTTGA TGAGG CTGGT 
GCTGCTCTGT CTAAATGTTC ACAAGTTCTT 

1235 

AAGTCTCGTG ACGCGGGTTT TAAATTGCTC 
GTTGTTCACA AGACTTACCG GTGTCCGCAA 
1240 CGTAAATGCG GTAATAGGGA CACGAAGTAT 
AGAAGTCTCA CGAAGCGTCG TATTACTTCT 
ACGTATCTTA CGATGACTCA AGCTGATAAA 

1245 

CCCGTGAGCA AGGACTGGAT TGATGGACAC 
TCTGTTGACA ACGTCACTTT GGTTCGGCTT 
1250 GAGGAGTACT GTTTGGTTGC CTTAACACGA 
GGCGAGCTCG CTGGTGATTT GATCTTTAAT 
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GGATGGCTGA GAACATTGGC TAGCAGATTT 7260 
AGTTTTAAGT TTCTGACTCG TCTCTCAAAC 7320 
ATATCACGTT TGAGAACGTT TTGGACTGAA 7380 
GTGCAGACAG CCAAGACCAA GCGCTCGAAG 7440 
ATACCTCAGG AGGAGTTTCA TGATGCCCCT 7500 
GACGTTAAAC CGGTGACTGA TGTGGTCCCG 7560 
GACCCTCGTG GCATATCTAG ACACGGAGCC 7620 
TTACATAACA ACTCCGAGTC TAATCTTCGT 768 0 
AGTGAGATCG CAAATAAGAG CATCTTTGAG 7740 
GTCCATTTGG CCAACGGTAA CTGGTTGTAT 7800 
AATGAGCATG GTTTAGGTCC GAAGCACGCA 7860 
GCATGCTCTA ACTTGAGGGA CATTGCAGAA 7920 
GATATTTCCA TGGTTGATGG AGTTGCGGGA 7980 
TTCCGTATGG GAGAGGACCT AATTGTGACG 8040 
ATGGCTTTAT TCCCTGACAC TTATAATTCC 8100 
GATTCTGCGA TCATGCACGG TGTACCGTCC 8160 
TTACTACATT ATGGTCAACT CCTGGTGGTG 8220 
GCCTTTGGGG ACACAGAGCA GATTTCGTTC 828 0 
CACGGTAATC TGCAATATGA TCGCCGTGAC 834 0 
GATGTTATCG CTGCTGTTAA TCTGCTGAAG 8400 
CAATCCTGGA CATCTGAGTC CAAAGTTTCT 8460 
GGTTTGCAGG TCACTATTGA TCCGAACAGA 8520 
GCGGCCCTTC AAACGAGGGC TAAGGATTTT 8580 
ATAAAAACAG TACACGAAGC GCAAGGGATC 8640 
AAGTCGACCA AATGTGATTT GTTTAAACAT 8700 
CACAAGAAGT CCTTTGAGTA TTGCTTTAAC 8760 
TGTGTTAAGT GATG CGCTTG TCTCTGTGTG 8820 
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AGACCTCTGC TCGAGAATTC GAGCTCGGTA CCCGGGGATC CTCTAGAGTC CGCAAATCAC 8 880 

1255 

CAGTCTCTCT CTACAAATCT ATCTCTCTCT ATTTTCTCCA GAATAATGTG TGAGTAGTTC 8 940 
CCAGATAAGG GAATTAGGGT TCTTATAGGG TTTCGCTCAT GTGTTGAGCA TATAAGAAAC 9000 
1260 CCTTAGTATG TATTTGTATT TGTAAAATAC TTCTATCAAT AAAATTTCTA ATTCCTAAAA 9060 
CCAAAATCCA GTGACCGGGT GGTCAGTCCC TTATGTTACG TCCTGTAGAA ACCCCAACCC 9X20 
GTGAAATCAA AAAACTCGAC GGCCTGTGGG CATTCAGTCT GGATCGCGAA AACTGTGGAA 918 0 

1265 

TTGATCAGCG TTGGTGGGAA AGCGCGTTAC AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 924 0 
GTTTTAACGA TCAGTTCGCC GATGCAGATA TTCGTAATTA TGCGGGCAAC GTCTGGTATC 9300 
1270 AGCGCGAAGT CTTTATACCG AAAGGTTGGG CAGGCCAGCG TATCGTGCTG CGTTTCGATG 9360 
CGGTCACTCA TTACGGCAAA GTGTGGGTCA ATAATCAGGA AGTGATGGAG CATCAGGGCG 942 0 
GCTATACGCC ATTTGAAGCC GATGTCACGC CGTATGTTAT TGCCGGGAAA AGTGTACAAT 9480 

1275 

TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 9540 
CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 9600 
1280 CGCCCTTCCC AACAGTTGCG CAG CCTGAAT GGCGAATGNN NNNNNAATTC AGTACATTAA 966 0 
AAACGTCCGC AATGTGTTAT TAAGTTGTCT AAGCGTCAAT TTGTTTACAC CACAATATAT 972 0 
CCTGCCACCA GCCAGCCAAC AGCTCCCCGA CCGGCAGCTC GGCACAAAAT CACCACTCGA 978 0 

1285 

TACAGGCAGC CCATCAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 984 0 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
.1290 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 

1295 

NNNNNNNNNN NNNNNNNNNN 10100 

<210> 6 
<211> 10240 
1300 <212> DNA 

<213> Brome mosaic virus 

<400> 6 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 

1305 

AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 12 0 
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GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
1310 AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 

1315 

TG CACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 480 
1320 TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 

1325 

TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATG CAAT GCGGCGGCTG CATACGCTTG 720 
ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAG CG A GCACGTACTC 780 
1330 GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
CAG CCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 

1335 

TCGACTGTGG CCGGCTGGGT GTGGCGGACC 
ATATTGCTGA AGAGCTTGGC GGCGAATGGG 
1340 CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT 
NNNNNNNNNN NNNNNNNNNN GATCGTTCAA 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA 

1345 

TAATAATTAA CATGTAATGC ATGACGTTAT 
CGCAATTATA CATTTAATAC GCGATAGAAA 
1350 TATCGCGCGC GGTGTCATCT ATGTTACTAG 
TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT 

1355 

GAAAAGATGG CAAACGCTAA TAAGGGGGCT 
CAGTCTGACG CTAAAGGCAA ACTTGATTCT 
1360 GGTTTCATTG GTGACGTTTC CGGCCTTGCT 



GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 
CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
ATCGCCTTCT TGACGAGTTC TTCTGANNNN 114 0 
ACATTTGG CA ATAAAGTTTC TTAAGATTGA 1200 
TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 144 0 
GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 
ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 
GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
AATGGTAATG GTGCTACTGG TGATTTTGCT 1740 



WO 99/61597 

GGCTCTAATT CCCAAATGGC TCAAGTCGGT 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA 

1365 

CCAATACGCA AACCGCCTCT CCCCGCGCGT 
AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG 
1370 CATTAGGCAC CCCAGGCTTT ACACTTTATG 
AGCGGATAAC AATTTCACAC AGGAAACAGC 
CTGCAGGTCA CTGGATTTTG GTTTTAGGAA 

1375 

AAATACAAAT ACATACTAAG GGTTTCTTAT 
AACCCTAATT CCCTTATCTG GGAACTACTC 
1380 TAGATTTGTA GAGAGAGACT GGTGATTTGC 
TAGCCAAAGT GGTCTGCCTG ACCAGGAGTT 
TTAGGTTCAT ATATCATAGA ACCGATCATC 

1385 

GGGACTTCAC GAGCAAAGCA TCAACTGACG 
CGACCTTCTT TTTCAAGCGT TGTGTGGTCC 
1390 GCTCAAGGGC CTCATGTAAA CGTCCGATCC 
CCGAGGAATA AATTCCAATG GTTCTGTAGA 
TCTTCCAGTT GTGAAACCAG TCACCATCAG 

1395 

TACATACAGG ATCGCTCATC TGATAAACTC 
ACCTCAGGAA ATTCTCGGAG TATAAAGAAA 
1400 AAATCCAAGG TTTTTCTTTC CCATATTTCA 
TCTTCTCATC AAGTTGATTA ATAAACTTCA 
TGAGCATCTG TTCATCACGC AGAATCTTTC 

1405 

GATCTGGTAC AGACACCAAA TTGCCCATTT 
AGGGCACACT AGGGTCCATG ACTTTTATCT 
1410 TATCCAGGAC TGGCTTAACT TTAGAGATGA 
AGTCACAGTC ACTTAGATCA GAGGCATATG 
CGAAATATGT AAACGCGTCA CCAGTTCTGC 

1415 
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GACGGTGATA ATTCACCTTT AATGAATAAT 18 00 
TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
TGGCCGATTC ATTAATGCAG CTGGCACGAC 1920 
CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 
CTTCCGGCTC GTATGTTGTG TGGAATTGTG 2040 
TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
TTAGAAATTT TATTGATAGA AGTATTTTAC 216 0 
ATGCTCAACA CATGAGCGAA ACCCTATAAG 2220 
ACACATTATT CTGGAGAAAA TAGAGAGAGA 2280 
GGACTCTAGA GGATCCCCAG CTTTTAAACT 2340 
TTTAACCTTA ACCAAAGGGC TGTTCACAGC 2400 
TCAGATCAGA GGGCTTAAAA GTCTCACAAT 2460 
TTAGGCCTCC TCTACCGGTA GCGTAATCGT 2520 
TACGATCATT AGCTAATTTG AGTGACTCAC 2580 
GTTTGACAGG GAGCTCCTTA GTACTACAGT 2640 
CTTTGTCTAA CACACCAGGA AACTTTGGAT 2700 
TTTTACGCTC TTCCGTGGTG CGTTTGAACT 2760 
TGATGCCTTC GGTACAGTAG CAATCAGAGA 2820 
AAGCCGCAAG AGCAGCTCTA ACCTCCTCGA 2880 
GATAAACAAA ATGACAGAGC GTCGTAATCA 2940 
TTCGATCACA GAAGGAAACG AAATGTGCTC 3000 
GCTTAGCTAA GCGCTGGATC TCTCTCAGAG 3060 
CAGTTTCGAC GAGAAACTTA CTACAAACGT 3120 
CCATATTGAA GAGAGACGTA AACATATCGG 3180 
TTAAAGAATC ATCTCCTGAA AATATTGCAC 3240 
CAATCATAGC CATAGTGACA AGAGTATTAC 3300 
GTTGGAAGGA AACGGACATT CCCACCTTGG 3360 
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CATGAGGGTC TGATAAATAA GAATCGCGAT GAAAATCAGA CCACCAATTC GTCAGCGGCG 3420 
CTGGAAAGCC CAGCGCAAGG AGTATCTCTC TCTGAAACTC TAGGTGCAGC TCACCCTGAG 3480 
1420 ATTTATCAAA TTTGCTTAGG TCCGCTTCAA GAAAGTATCT GTTATT CAAG CGGACATTCT 354 0 
TAAGCTCCAG AGAGGATATC TTTCCGATAG GCACAATGAA CCTGGATTTC AGGGCCAGTG 3600 
ATAACTTCTC GAAACAAGCA GTGAAAAAGG GTGAAAAATT ACTAGTCACA CCTTTACTAT 3660 

1425 

GAAATGTTAT AGTAGCTGCT ACTGCTCGTT CCAAGTGAAG GGTGTCAGTT ACAACAGGTT 3720 
TTACGTCAGA CTTCAGCATA TGCTGGTACC GACATAAATC AGTCTCTGCT GCCACATTCA 3780 
1430 CACCTTGCAA GTCCATGTGC TTACCCCACT TCTTATGGTA CTCAAGACAT TTAGTCATGA 3 84 0 
CATCCATAGA AGCTCTCAGA CAGTCTTCAC CGTCAACATT AAGGAATGTG CTACGAAAGC 3 900 
GCTTTGCTAT AGCTTTCGCA GTGTCCTTCA TGTTAATCGC GTCTCCCATT TCTGGAACGT 3 960 

1435 

CCGCGTTTCG CTTTTTGAGT GCGGTTAAGA CTTCTTTCTG AGTACCAACT CTTCGCTGAG 4020 
CACTCCCGAT ATTCATTTTT GGTTGAAAAT ATTTATCGGG GTCCCTATAC CAGTCTACAT 4080 
1440 CACTTTGCTT AAGTCTGATC CTATCAAAGT CCATGGAATA ATCACCATTT TCAACAAGGG 414 0 
CTTGATGGTA CGAATCATCG AAATAAG CAT GGGTTGGCAG TATGGAATGA CTGGTCGCTT 4200 
CTGTTCTAGC AAGGCTGACT CTCTCCATAT AAATTGGCCC AGTAGAGATG TCAGGGTTAT 4260 

1445 

CTGGATGGCA GTGTGTATCA ATAACACGCG AAACCCTATG TTCAATAGGG TTCATGATTT 4320 
GAAGAGTGAT GTCGTAATCA GTATTAGTAG TCTGAAACTC TTCATCAATG CCCATGTACC 4380 
1450 TATCTCCAAG GGTCAGCTCC TTGGGGGTAT CTCCAGTAAC ACGAACTTCC TCAATTTCAC 4440 
AGTTCGAGGA ATCACTGGCG AGTTTTAGAT CGCTCGCATG ATCTTCATCG GCGGCAAACG 4500 
ATACACCGTA ACCATCACTA GTATCCTCGG GATACCAGTC ATCAATTTCA TCTTCGAGCA 456 0 

1455 

CGAAAGAGCC CGGAATGTCA AGATATAACA TCCGTGCCAT TTCAGCTTGA GGAAT CAGCG 4620 
GTCTATCGGT GAACTGTTGA ACCATTTGTT GGACGGTGTC GCAAATAGAG CCCCAGCGCA 4680 
1460 CTCGGTCAAA AGGGGGATCG AATACCCCTC CTATCTCCAA GGG CGCTATA GCTAATTTAA 4740 
AACTCGCGAG AGATCCGTCA ATGGCAACTC CGTCTGC CGG CTCCTGCACC TGAAGG CTAG 4800 
CAGCCTCCAC CTCGTCTTCT AAGGATTGAT CTATGATCCA TTGGAAAGAC GGGACCTGGC 4860 

1465 

GAACGAAATC ATCATCCCAG GTTTTCGAAG ACATCTTGGT GATAGTAGAA AGAACAAGCA 4920 
CACAACAACA ACAAGGTCAG ATGTGTGTTG CGGGTACCGA GCTCGAATTC TCGAGGTCCT 4980 
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1470 CTCCAAATGA AATGAACTTC CTTATATAGA GGAAGGGTCT TGCGAAGGAT AGTGGGATTG 504 0 
TGCGTCATCC CTTACGTCAG TGGAGATATC ACATCAATCC ACTTGCTTTG AAGACGTGGT 5100 
TGGAACGTCT TCTTTTTCCA CGATGTTCCT CGTGGGTGGG GGTCCATCTT TGGGACCACT 5160 

1475 

GTCGGTAGAG GCATTCTTGA ACGATAGCCT TTCCTTTATC GCAATGATGG CATTTGTAGA 5220 
AGCCATCTTC CTTTTCTACT GTCCTTTCGA TGAAGTGACA GATAGCTGGG CAATGGAATC 5280 
1480 CGAGGAGGTT TCCCGATATT ACCCTTTGTT GAAAAGTCTC AATAGCCCTC TGGTCTTCTG 5340 
AGACTGTATC TTTGATATTC TTGGAGTAGA CGAGAGTGTC GTGCTCCACC ATGTTGACCT 5400 
GCAGGCAGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGT CACTGGATTT 5460 

1485 

TGGTTTTAGG AATTAGAAAT TTTATTGATA GAAGTATTTT ACAAATACAA ATACATACTA 5520 
AGGGTTTCTT ATATGCTCAA CACATGAGCG AAACCCTATA AGAACCCTAA TTCCCTTATC 5580 
1490 TGGGAACTAC TCACACATTA TTCTGGAGAA AATAGAGAGA GATAGATTTG TAGAGAGAGA 5640 
CTGGTGATTT GCGGACTCTA GAGGATCCCC GGGTACCGAG CTCGAATTCT CGAG CAGAGG 5700 
TCTCACACAG AGACAAGCGC ATCACTTAAC ACAATTAAAG ATCAAATCAC CAGCGAGCTC 5760 

1495 

GCCGTTAAAG CAATACTCAA AGGACTTCTT GTGTCGTGTT AAGGCAACCA AACAGTACTC 5820 
CTCATGTTTA AACAAATCAC ATTTGGTCGA CTTAAGCCGA ACCAAAGTGA CGTTGTCAAC 5880 
1500 AGAGATCCCT TGCGCTTCGT GTACTGTTTT TATGTGTCCA TCAATCCAGT CCTTGCTCAC 5940 
GGGAAAATCC TTAGCCCTCG TTTGAAGGGC CGCTTTATCA GCTTGAGTCA TCGTAAGATA 6000 
CGTTCTGTTC GGATCAATAG TGACCTGCAA ACCAGAAGTA ATACGACGCT TCGTGAGACT 6060 

1505 

TCTAGAAACT TTGGACTCAG ATGTCCAGGA TTGATACTTC GTGTCCCTAT TAC CGCATTT 6120 
ACGCTTCAGC AGATTAACAG CAGCGATAAC ATCTTGCGGA CACCGGTAAG TCTTGTGAAC 6180 
1510 AACGTCACGG CGATCATATT GCAGATTACC GTGGAGCAAT TTAAAACCCG CGTCACGAGA 624 0 
CTTGAACGAA ATCTGCTCTG TGTCCCCAAA GGCAAGAACT TGTGAACATT TAGACAGAGC 6300 
AGCCACCACC AGGAGTTGAC CATAATGTAG TAAACCAGCC TCATCAACAA GCAGCCTATG 6360 

1515 

ACAGGACGGT ACACCGTGCA TGATCGCAGA ATCCGCGGTG CGCACAACGT CCAAAGCTAC 6420 
CTTGGAATTA TAAGTGTCAG GGAATAAAGC CATCCTGACG TCCTCGGCCG ATTTACGATT 6480 
1520 CGCCGTCACA ATTAGGTCCT CTCCCATACG GAATGCATCT TTTATGGCAG TGGTTTTACC 6540 
GCATCCCGCA ACTCCATCAA CCATGGAAAT ATCGCATGTA GGGACAGAAA CTTTGGCGCT 6600 
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AGCTTCTGCA ATGTCCCTCA AGTTAGAGCA TGCACATGTT TTATCAACAA TGTACGTTTC 6660 

1525 

ATCTGCGTGC TTCGGACCTA AACCATGCTC ATTATATCCA ACAGTGTAAT CGTATTTTTT 6720 
AGGATACAAC CAGTTACCGT TGGCCAAATG GACATTCACC ATATCGTCTA TGCGATGGTA 6780 
1530 GGTCTCAAAG ATGCT CTTAT TTG CGATCTC ACTTCCGCGA CCGCCGGAAA TGTCCCATAG 684 0 
GTGACGAAGA TTAGACTCGG AGTTGTTATG TAATCTCTTA CAATAACGCA CAAATTCCTT 6900 
CATGGCTCCG TGTCTAGATA TGCCACGAGG GTCCGTTGGT AC CTCAACAG ACACCTCGGC 6960 

1535 

ATCCGGGACC ACATCAGTCA CCGGTTTAAC GTCATCACTG ACGGACTCAG GGCTCGAACT 7020 
CTCAGGGGCA TCATGAAACT CCTCCTGAGG TATCTCAGCA GCTGGCGGGA CTTTCGCCTT 7080 
1540 CTTCTTCGAG CGCTTGGTCT TGGCTGTCTG CACTTCATGC TCCAGCCGGT CGAATAAGTC 7140 
CTCTTCAGTC CAAAACGTTC TCAAACGTGA TATCGGTACA GAATCTTGCT CAAATTCTTC 7200 
AACGTTTGAG AGACGAGTCA GAAACTTAAA ACTGTCCGCA TAAGAATCCA GACGTAGTAG 7260 

1545 

GGGAAATCTG CTAGCCAATG TTCTCAGCCA TCCTACTTTC GCCCTGGATG AATCTCCACC 732 0 
CCACCAAAAC CTAGTTTTGA AGTGATGGCA CCAACCTTTC CATTCCATCC CATCG CGG AG 7380 
1550 GGCCGTAAGC TTTTCGTACT TTTGATACAG ATTCAAAGTC AAAGCAAAGG CCACTAGATG 7440 
ATAATCTTCA ATGTCTAAGC GCTCACCAGC CATGATAGCC TGACCGTTAA TAATAACAGT 7500 
CGACGACTTG GCGGATAAGA TAGATGCGAC AGCTTTCATG TTCTCAGTCC ATTCTTTACT 7560 

1555 

TTCCTTGAAA CATCTGAAAG CTATCTCCTC TACCTCTCTC ACTGTGGTTT TGGCGACGCG 7620 
CACACATTTC CAGCGATTGA GACTCCAGTC TTCAGGTATT GAGACCCCTA CGTACTTAGA 7680 
1560 TATGTCTTCA AACCATACAC AGTGACGTAG TGTCTCCCGG GGGCAGCGTA AATTTGTAGC 7740 
GATGATCTTA TAGGTCATGA TGTTACATTT CAGCATTTCG CGCTCCAACA GATAGGTGGT 7800 
TCCATCGATG CAATGCACCG ACTCGGTGAA AAATGAGCCC AAATCTTGCC ATCCGTGGAT 7860 

1565 

GTAAGATAAT GTGCTTTCAT TTTCAAAATC GAATTTGATC ACCTCATCCG CGCCTGACCC 7920 
GTCACGTTGC CAGTGACATT TAAGCAAGGG AAGAAAACCC TCGCGGTCAA ACAACATGGC 7980 
1570 GCCGTCGAAC ATAACGGTAC CACGTAGTAC GCGTACTCCA TGCGAATGCA TGGCGTCACA 8040 
CAGACCTTGG AAGCCCATAT CATAACCGCC GTGGATACAG AT AG CCCAAT CAGCTTGGAC 8100 
ATCACAATCT TGAGCTCGGT TAAGACAAAA GTTCGGGACT TCATCGAAAT CATCGCTTTC 8160 

1575 

TTGCAAAATT TTTCGCATGC GGCACATCCT CTCCTCATGT CGGG CAGCGT CTCTAACACC 8220 
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CAACACAGGA CAACAACTGT GCACCCTTTT ATCCCTTCTT GAAAAGTGAT G C C AC CAAGA 8280 
1580 CCCTCCGAAA TCTATAACGG GGTCTTCAGG GGGAAAACTG TCGAGACAGT CATAATGCTC 834 0 
CGCTACACGC AGAGCACCAG CCAGGCTATG GGGCGCATGA TACTGCTGAG TCAAATTTAA 84 00 
GTCAAAGGCA CCACCATAAC GGTCACGGAA GGCGTCAGCC TCCTCAATAG AGAGCTTATT 8460 

1585 

GCGAACGTTG ATTTTCTTAG ACCTTTTCGC GTATTCAATC TGCGCAGATA ACTGTTGCGC 8520 
AACCTGATTG TCTACGATGT CTTGGGCACT CTGGCTGTCA GCACCCTTCT CAG CAATCAA 8580 
1590 CTTCAGCAAA TCGATAGAAC TTGACATTTT GTTGGTGAAA AACAAAGAAC AAGTAGCAGA 8640 
ACCGTGGTCG AGGTCCTCTC CAAATGAAAT GAACTTCCTT ATATAGAGGA AGGGT CTTGC 8700 
GAAGGATAGT GGGATTGTGC GTCATCCCTT ACGTCAGTGG AGATATCACA TCAATCCACT 8760 

1595 

TGCTTTGAAG ACGTGGTTGG AACGTCTTCT TTTTCCACGA TGTTCCTCGT GGGTGGGGGT 882 0 
CCATCTTTGG GACCACTGTC GGTAGAGGCA TTCTTGAACG ATAGC CTTTC CTTTATCGCA 888 0 
1600 ATGATGG CAT TTGTAGAAGC CATCTTCCTT TTCTACTGTC CTTTCGATGA AGTGACAGAT 894 0 
AGCTGGGCAA TGGAATCCGA GGAGGTTTCC CGATATTACC CTTTGTTGAA AAGTCTCAAT 9000 
AGCCCTCTGG TCTTCTGAGA CTGTATCTTT GATATTCTTG GAGTAGACGA GAGTGTCGTG 9060 

1605 

CTCCACCATG TTGACCGGGT GGTCAGTCCC TTATGTTACG TCCTGTAGAA ACCCCAACCC 9120 
GTGAAATCAA AAAACTCGAC GGCCTGTGGG CATTCAGTCT GGATCGCGAA AACTGTGGAA 9180 
1610 TTGATCAGCG TTGGTGGGAA AGCGCGTTAC AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 9240 
GTTTTAACGA TCAGTTCGCC GATGCAGATA TTCGTAATTA TGCGGGCAAC GTCTGGTATC 93 00 
AGCGCGAAGT CTTTATACCG AAAGGTTGGG CAGGCCAGCG TATCGTGCTG CGTTTCGATG 9360 

1615 

CGGTCACTCA TTACGGCAAA GTGTGGGTCA ATAATCAGGA AGTGATGGAG CATCAGGGCG 9420 
GCTATACGCC ATTTGAAGCC GATGTCACGC CGTATGTTAT TGCCGGGAAA AGTGTACAAT 94 80 
1620 TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 9540 
CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGG CGTAATA GCGAAGAGGC CCGCACCGAT 9600 
CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGNN NNNNNAATTC AG TACATTAA 9660 

1625 

AAACGTCCGC AATGTGTTAT TAAGTTGTCT AAGCGTCAAT TTGTTTACAC CACAATATAT 9720 
CCTGCCACCA GCCAGCCAAC AGCTCCCCGA CCGGCAGCTC GGCACAAAAT CACCACTCGA 9780 
1630 TACAGGCAGC CCATCAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9840 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 

1635 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 
1640 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10140 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10200 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10240 

1645 

<210> 7 
<211> 10272 
<212> DNA 

<213> Brome mosaic virus 

1650 

<400> 7 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 

1655 

GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
1660 TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
TGCACGCAGG TTCTCCGGCC GCTTGGG TGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 

1665 

AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 48 0 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
1670 TATCGTGGCT GGC CACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 

1675 

ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
1680 CAGCCGAACT GTTCGCCAGG CTCAAGG CGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 
TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 

1685 
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ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTG AC CGCTT CCTCGTGCTT TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
1690 NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA T AT AATTT CT GTTGAATTAC GTTAAGCATG 1260 
TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 

1695 

CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
1700 TCTGGTGGTG GTTCTGGTGG CGG CTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTC CGGTG A TTTTGATTAT 1560 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 

1705 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 168 0 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 1740 
1710 GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 1800 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGG CACG AC 1920 

17J5 

AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAG CTCACT 1980 
CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
1720 AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCTGCC 2100 
TGCAGGTCAA CATGGTGGAG CACGACACTC TCGTCTACTC CAAGAATATC AAAGATACAG 2160 
TCTCAGAAGA CCAGAGGGCT ATTGAGACTT TTCAACAAAG GGTAATATCG GGAAACCTCC 2220 

1725 

TCGGATTCCA TTGCCCAGCT ATCTGTCACT TCATCGAAAG GACAGTAGAA AAGGAAGATG 2280 
GCTTCTACAA ATGCCATCAT TGCGATAAAG GAAAGGCTAT CGTTCAAGAA TGCCTCTACC 2340 
1730 GACAGTGGTC CCAAAGATGG ACCCCCACCC ACGAGGAACA TCGTGGAAAA AGAAGACGTT 2400 
CCAACCACGT CTTCAAAGCA AGTGGATTGA TGTGATATCT CCACTGACGT AAGGGATGAC 2460 
GCACAATCCC ACTATCCTTC GCAAGACCCT TCCTCTATAT AAGGAAGTTC ATTTCATTTG 2520 

1735 

GAGAGGACCT CGAGAATTCG AGCTCGGTAC CCGCAACACA CATCTGACCT TGTTGTTGTT 2580 
GTGTGCTTGT TCTTTCTACT ATCACCAAGA TGTCTTCGAA AACCTGGGAT GATGATTTCG 2640 
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1740 TTCGCCAGGT CCCGTCTTTC CAATGGATCA TAGATCAATC CTTAGAAGAC GAGGTGGAGG 2700 
CTG CTAGCCT TCAGGTGCAG GAGCCGGCAG ACGGAGTTGC CATTGACGGA TCTCTCGCGA 2760 
GTTTTAAATT AGCTATAGCG CCCTTGGAGA TAGGAGGGGT ATTCGATCCC CCTTTTGACC 2820 

1745 

GAGTGCGCTG GGG CTCTATT TGCGACACCG TCCAACAAAT GGTTCAACAG TTCACCGATA 288 0 
GACCGCTGAT TCCTCAAGCT GAAATGGCAC GGATGTTATA T CTTGAC ATT CCGGGCTCTT 2940 
1750 TCGTGCTCGA AGATGAAATT GATGACTGGT ATCCCGAGGA TACTAGTGAT GGTTACGGTG 3000 
TATCGTTTGC CGCCGATGAA GATCATGCGA GCGATCTAAA ACTCGCCAGT GATTCCTCGA 3060 
ACTGTGAAAT TGAGGAAGTT CGTGTTACTG GAG ATACCC C CAAGGAGCTG ACCCTTGGAG 3120 

1755 

ATAGGTACAT GGGCATTGAT GAAGAGTTTC AGACTACTAA TACTGATTAC GACATCACTC 3180 
TTCAAATCAT GAACCCTATT GAACATAGGG TTTCGCGTGT TATTGATACA CACTGCCATC 324 0 
1760 CAGATAACCC TGACATCTCT ACTGGGCCAA TTTATATGGA GAGAGTCAGC CTTGCTAGAA 3300 
CAGAAGCGAC CAGTCATTCC ATACTGCCAA CCCATGCTTA TTTCGATGAT TCGTACCATC 3360 
AAGCCCTTGT TGAAAATGGT GATTATTCCA TGGACTTTGA TAGGATCAGA CTTAAGCAAA 342 0 

1765 

GTGATGTAGA CTGGTATAGG GACCCCGATA AATATTTTCA ACCAAAAATG AATATCGGGA 348 0 
GTGCTCAGCG AAGAGTTGGT ACTCAGAAAG AAGTCTTAAC CGCACTCAAA AAG CGAAACG 3540 
1770 CGGACGTTCC AGAAATGGGA GACGCGATTA ACATGAAGGA CACTGCGAAA GCTATAGCAA 3600 
AGCGCTTTCG TAGCACATTC CTTAATGTTG ACGGTGAAGA CTGTCTGAGA GCTTCTATGG 366 0 
ATGTCATGAC TAAATGTCTT GAGTACCATA AGAAGTGGGG TAAGCACATG GACTTGCAAG 3720 

1775 

GTGTGAATGT GGCAGCAGAG ACTGATTTAT GTCGGTACCA GCATATGCTG AAGTCTGACG 3780 
TAAAACCTGT TGTAACTGAC ACCCTTCACT TGGAACGAGC AGTAGCAGCT ACTATAACAT 384 0 
1780 TTCATAGTAA AGGTGTGACT AGTAATTTTT CACCCTTTTT CACTGCTTGT TTCGAGAAGT 3900 
TATCACTGGC CCTGAAATCC AGGTTCATTG TGCCTATCGG AAAGATATCC TCTCTGGAGC 3960 
TTAAGAATGT CCGCTTGAAT AACAGATACT TTCTTGAAGC GGACCTAAGC AAATTTGATA 4020 

1785 

AATCTCAGGG TGAGCTGCAC CTAGAGTTTC AGAGAGAGAT ACTCCTTGCG CTGGGCTTTC 4080 
CAGCGCCGCT GACGAATTGG TGGTCTGATT TTCATCGCGA TTCTTATTTA TCAGACCCTC 414 0 
1790 ATGCCAAGGT GGGAATGTCC GTTTCCTTCC AACGCAGAAC TGGTGACGCG TTTACATATT 4200 
TCGGTAATAC TCTTGTCACT ATGGCTATGA TTGCATATGC CTCTGATCTA AGTGACTGTG 426 0 
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ACTGTGCAAT ATTTTCAGGA GATGATTCTT TAATCATCTC TAAAGTTAAG CCAGTCCTGG 432 0 

1795 

ATACCGATAT GTTTACGTCT CTCTTCAATA TGGAGATAAA AGTCATGGAC CCTAGTGTGC 4380 
CCTACGTTTG TAGTAAGTTT CTCGTCGAAA CTGAAATGGG CAATTTGGTG TCTGTACCAG 444 0 
1800 ATCCTCTGAG AGAGATCCAG CGCTT AG CTA AGCGAAAGAT TCTGCGTGAT GAACAGATGC 4500 
TCAGAGCACA TTTCGTTTCC TTCTGTGATC GAATGAAGTT TATTAATCAA CTTGATGAGA 456 0 
AGATGATTAC GACGCTCTGT CATTTTGTTT ATCTGAAATA TGGGAAAGAA AAACCTTGGA 462 0 

1805 

TTTTCGAGGA GGTTAGAGCT GCTCTTGCGG CTTTTTCTTT ATACTCCGAG AATTTCCTGA 468 0 
GGTTCTCTGA TTGCTACTGT ACCGAAGGCA T CAGAGTTT A TCAGATGAGC GATCCTGTAT 474 0 
1810 GTAAGTTCAA ACGCACCACG GAAGAGCGTA AAACTGATGG TGACTGGTTT CACAACTGGA 4800 
AGAATCCAAA GTTTCCTGGT GTGTTAGACA AAGTCTACAG AACCATTGGA ATTTATTCCT 4860 
CGGACTGTAG TACTAAGGAG CTCCCTGTCA AACGGATCGG ACGTTTACAT GAGGCCCTTG 4 92 0 

1815 

AGCGTGAGTC ACTCAAATTA GCTAATGATC GTAGGACCAC ACAACGCTTG AAAAAGAAGG 4 98 0 
TCGACGATTA CGCTACCGGT AGAGGAGGCC TAACGTCAGT TGATGCTTTG CTCGTGAAGT 504 0 
1820 CCCATTGTGA GACTTTTAAG CCCTCTGATC TGAGATGATC GGTTCTATGA TATATGAACC 5100 
TAAGCTGTGA ACAGCCCTTT GGTTAAGGTT AAAAACTCCT GGTCAGGCAG ACCACTTTGG 5160 
CTAAGTTTAA AAG CTGGGGA TCCTCTAGAG TCCGCAAATC ACCAGTCTCT CTCTACAAAT 522 0 

1825 

CTATCTCTCT CTATTTTCTC CAGAATAATG TGTGAGTAGT TCCCAGATAA GGGAATTAGG 5280 
GTTCTTATAG GGTTTCGCTC ATGTGTTGAG CATATAAGAA ACCCTTAGTA TGTATTTGTA 534 0 
1830 TTTGTAAAAT ACTTCTATCA ATAAAATTTC TAATTCCTAA AACCAAAATC CAGTGACCTG 5400 
CAGGCATGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGT CAACATGGTG 5460 
GAGCACGACA CTCTCGTCTA CTCCAAGAAT ATCAAAGATA CAGTCTCAGA AGACCAGAGG 5520 

1835 

GCTATTGAGA CTTTTCAACA AAGGGTAATA TCGGGAAACC TCCTCGGATT CCATTGCCCA 558 0 
GCTATCTGTC ACTTCAT CGA AAGGACAGTA GAAAAGGAAG ATGGCTTCTA CAAATGCCAT 5640 
1840 CATTGCGATA AAGGAAAGGC TATCGTTCAA GAATG CCTCT ACCGACAGTG GTCCCAAAGA 5700 
TGGACCCCCA CCCACGAGGA ACATCGTGGA AAAAGAAGAC GTTCCAACCA CGTCTTCAAA 5760 
GCAAGTGGAT TGATGTGATA TCTCCACTGA CGTAAGGGAT GACGCACAAT CCCACTATCC 5820 

1845 

TTCGCAAGAC CCTTCCTCTA TATAAGGAAG TTCATTTCAT TTGGAGAGGA CCTCGACCAC 5880 
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GGTTCTG CTA CTTGTTCTTT GTTTTTCACC AACAAAATGT CAAGTTCTAT CGATTTGCTG 594 0 
1850 AAGTTGATTG CTGAGAAGGG TGCTGACAGC CAGAGTGCCC AAGACATCGT AGACAATCAG 6000 
GTTGCGCAAC AGTTATCTGC GCAGATTGAA TACGCGAAAA GGTCTAAGAA AATCAACGTT 606 0 
CGCAATAAGC TCTCTATTGA GGAGGCTGAC GCCTTCCGTG ACCGTTATGG TGGTGCCTTT 6120 

1855 

GACTTAAATT TGACTCAGCA GTATCATGCG CCCCATAGCC TGGCTGGTGC TCTGCGTGTA 6180 
GCGGAGCATT ATGACTGTCT CGACAGTTTT CCCCCTGAAG ACCCCGTTAT AGATTTCGGA 6240 
1860 GGGTCTTGGT GGCAT CACTT TTCAAGAAGG GATAAAAGGG TGCACAGTTG TTGTCCTGTG 6300 
TTGGGTGTTA GAGACGCTGC CCGACATGAG GAGAGGATGT GCCGCATGCG AAAAATTTTG 6360 
CAAGAAAGCG ATGATTTCGA TGAAGTCCCG AACTTTTGTC TTAACCGAGC TCAAGATTGT 642 0 

1865 

GATGTCCAAG CTGATTGGGC TATCTGTATC CACGGCGGTT ATGATATGGG CTTCCAAGGT 64 8 0 
CTGTGTGACG CCATGCATTC GCATGGAGTA CGCGTACTAC GTGGTACCGT TATGTTCGAC 654 0 
1870 GGCGCCATGT TGTTTGACCG CGAGGGTTTT CTTCCCTTGC TTAAATGTCA CTGGCAACGT 6600 
GACGGGTCAG GCGCGGATGA GGTGATCAAA TTCGATTTTG AAAATGAAAG CACATTATCT 6660 
TACATCCACG GATGGCAAGA TTTGGGCTCA TTTTTCACCG AGTCGGTGCA TTGCATCGAT 672 0 

1875 

GGAACCACCT ATCTGTTGGA GCGCGAAATG CTGAAATGTA ACATCATGAC CTATAAGATC 678 0 
ATCGCTACAA ATTTACGCTG CCCCCGGGAG ACACTACGTC ACTGTGTATG GTTTGAAGAC 684 0 
1880 ATATCTAAGT ACGTAGGGGT CTCAATACCT GAAGACTGGA GTCTCAATCG CTGGAAATGT 6900 
GTGCGCGTCG CCAAAACCAC AGTGAGAGAG GTAGAGGAGA TAGCTTTCAG ATGTTTCAAG 6960 
GAAAGTAAAG AATGGACTGA GAACATGAAA GCTGTCGCAT CTATCTTATC CGCCAAGTCG 702 0 

1885 

TCGACTGTTA TTATTAACGG TCAGG CTATC ATGGCTGGTG AGCGCTTAGA CATTGAAGAT 708 0 
TATCATCTAG TGGCCTTTGC TTTGACTTTG AATCTGTATC AAAAGTACGA AAAGCTTACG 7140 
1890 GCCCTCCGCG ATGGGATGGA ATGGAAAGGT TGGTGCCATC ACTTCAAAAC TAGGTTTTGG 7200 
TGGGGTGGAG ATTCATCCAG GGCGAAAGTA GGATGGCTGA GAACATTGGC TAGCAGATTT 7260 
CCCCTACTAC GTCTGGATTC TTATG CGGAC AGTTTTAAGT TTCTGACTCG TCTCTCAAAC 73 20 

1895 

GTTGAAGAAT TTGAGCAAGA TTCTGTACCG ATATCACGTT TGAGAACGTT TTGGACTGAA 738 0 
GAGGACTTAT TCGACCGGCT GGAGCATGAA GTGCAGACAG CCAAGACCAA GCGCTCGAAG 744 0 
1900 AAGAAGGCGA AAGTCCCGCC AGCTGCTGAG ATACCTCAGG AGGAGTTTCA TGATGCCCCT 7500 



WO 99/61597 

GAGAGTTCGA GCCCTGAGTC CGTCAGTGAT 
GATGCCGAGG TGTCTGTTGA GGTACCAACG 

1905 

ATGAAGGAAT TTGTGCGTTA TTGTAAGAGA 
CACCTATGGG ACATTTCCGG CGGTCGCGGA 
19 10 ACCTACCATC GCATAGACGA TATGGTGAAT 
CCTAAAAAAT ACGATTACAC TGTTGGATAT 
GATGAAACGT ACATTGTTGA TAAAACATGT 

1915 

GCTAGCGCCA AAGTTTCTGT CCCTACATGC 
TGCGGTAAAA CCACTGCCAT AAAAGATGCA 
1920 GCGAATCGTA AATCGGCCGA GGACGTCAGG 
AAGGTAGCTT TGGACGTTGT GCGCACCGCG 
TGTCATAGGC TGCTTGTTGA TGAGGCTGGT 

1925 

GCTGCTCTGT CTAAATGTTC ACAAGTTCTT 
AAGTCTCGTG ACGCGGGTTT TAAATTGCTC 
1930 GTTGTTCACA AGACTTACCG GTGTCCGCAA 
CGTAAATGCG GTAATAGGGA CACGAAGTAT 
AGAAG TCTCA CGAAGCGTCG TATTACTTCT 

1935 

ACGTATCTTA CGATGACTCA AGCTGATAAA 
CCCGTGAGCA AGGACTGGAT TGATGGACAC 
1940 TCTGTTGACA ACGTCACTTT GGTTCGGCTT 
GAGGAGTACT GTTTGGTTGC CTTAACACGA 
GGCGAGCTCG CTGGTGATTT GATCTTTAAT 

1945 

AGACCTCTGC TCGAGAATTC GAGCTCGGTA 
CAGTCTCTCT CTACAAATCT ATCTCTCTCT 
1950 CCAGATAAGG GAATTAGGGT TCTTATAGGG 
CCTTAGTATG TATTTGTATT TGTAAAATAC 
CCAAAATCCA GTGACCGGGT GGTCAGTCCC 

1955 



PCT/US99/11250 

36 

GACGTTAAAC CGGTGACTGA TGTGGTCCCG 7560 
GACCCTCGTG GCATATCTAG ACACGGAGCC 7620 
TTACATAACA ACTCCGAGTC TAATCTTCGT 7680 
AGTGAGATCG CAAATAAGAG CATCTTTGAG 774 0 
GTCCATTTGG CCAACGGTAA CTGGTTGTAT 7800 
AATGAGCATG GTTTAGGTCC GAAGCACGCA 7860 
GCATGCTCTA ACTTGAGGGA CATTGCAGAA 792 0 
GATATTTCCA TGGTTGATGG AGTTGCGGGA 798 0 
TTCCGTATGG GAGAGGACCT AATTGTGACG 804 0 
ATGGCTTTAT TCCCTGACAC TTATAATTCC 8100 
GATTCTGCGA TCATGCACGG TGTACCGTCC 816 0 
TTACTACATT ATGGTCAACT CCTGGTGGTG 8220 
GCCTTTGGGG ACACAGAGCA GATTTCGTTC 8280 
CACGGTAATC TGCAATATGA TCGCCGTGAC 834 0 
GATGTTATCG CTGCTGTTAA TCTGCTGAAG 84 00 
CAATCCTGGA CATCTGAGTC CAAAGTTTCT 8460 
GGTTTGCAGG TCACTATTGA TCCGAACAGA 8520 
GCGGCCCTTC AAACGAGGGC TAAGGATTTT 858 0 
ATAAAAACAG TACACGAAGC GCAAGGGATC 8640 
AAGTCGACCA AATGTGATTT GTTTAAACAT 87 00 
CACAAGAAGT CCTTTGAGTA TTGCTTTAAC 8760 
TGTGTTAAGT GATGCGCTTG TCTCTGTGTG 8820 
CCCGGGGATC CTCTAGAGTC CGCAAATCAC 8880 
ATTTTCTCCA GAATAATGTG TGAGTAGTTC 8940 
TTTCGCTCAT GTGTTGAGCA TATAAGAAAC 9000 
TTCTATCAAT AAAATTTCTA ATTCCTAAAA 9060 
TTATGTTACG TCCTGTAGAA ACCCCAACCC 9120 
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GTGAAATCAA AAAACTCGAC GGCCTGTGGG CATTCAGTCT GGATCGCGAA AACTGTGGAA 9180 
TTGATCAGCG TTGGTGGGAA AGCGCGTTAC AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 9240 
1960 GTTTTAACGA TCAGTTCGCC GATGCAGATA TTCGTAATTA TGCGGGCAAC GTCTGGTATC 9300 
AGCGCGAAGT CTTTATACCG AAAGGTTGGG CAGGCCAGCG TATCGTGCTG CGTTTCGATG 9360 
CGGTCACTCA TTACGGCAAA GTGTGGGTCA ATAATCAGGA AGTGATGGAG CATCAGGGCG 9420 

1965 

GCTATACGCC ATTTGAAGCC GATGTCACGC CGTATGTTAT TGCCGGGAAA AGTGTACAAT 9480 
TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 954 0 
1970 CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 9600 
CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGNN NNNNNAATTC AGTACATTAA 9660 
AAACGTCCGC AATGTGTTAT TAAGTTGTCT AAGCGTCAAT TTGTTTACAC CACAATATAT 9720 

1975 

CCTGCCACCA GCCAGCCAAC AGCTCCCCGA CCGGCAGCTC GGCACAAAAT CACCACTCGA 9780 
TACAGGCAGC CCATCAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 984 0 
1980 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 

1985 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10140 
1990 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10200 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10260 
NNNNNNNNNN NN 10272 



1995 



2000 



2005 



<210> 8 
<211> 10166 
<212> DNA 

<213> Brome mosaic virus 
<400> 8 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 
GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 



• 



t WO 99/61597 PCT/US99/11250 

38 

2010 TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
TGCACG CAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 

2015 

AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AG CGCAGGGG CGCCCGGTTC 480 
TTTTTGTCAA GACCGAC CTG TCCGGTGCCC TGAATGAACT GGAGGACGAG GCAGCGCGGC 54 0 
2020 TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 

2025 

ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCG CGC 840 
2030 CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 
TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 102 0 

2035 

ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
2040 NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 

2045 

CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
2050 TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGG CTCTG GTTCCGGTGA TTTTGATTAT 1560 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 

2055 

CAGTCTGACG CTAAAGGGAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
2060 GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 1800 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 



2065 
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CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATG CAG CTGGCACGAC 192 0 
AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 
CATTAGG CAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 2040 
2070 AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCTGCC 2100 
TGCAGGTCAA CATGGTGGAG CACGACACTC TCGTCTACTC CAAGAATATC AAAGATACAG 2160 
TCTCAGAAGA CCAGAGGGCT ATTGAGACTT TTCAACAAAG GGTAATATCG GGAAACCTCC 222 0 

2075 

TCGGATTCCA TTGCCCAGCT ATCTGTCACT TCATCGAAAG GACAGTAGAA AAGGAAGATG 2280 
G CTTCTACAA ATGCCATCAT TGCGATAAAG GAAAGGCTAT CGTTCAAGAA TGCCTCTACC 234 0 
2080 GACAGTGGTC CCAAAGATGG ACCCCCACCC ACGAGGAACA TCGTGGAAAA AGAAGACGTT 24 00 
CCAACCACGT CTTCAAAGCA AGTGGATTGA TGTGATATCT CCACTGACGT AAGGGATGAC 2460 
GCACAATCCC ACTATCCTTC GCAAGACCCT TCCTCTATAT AAGGAAGTTC ATTTCATTTG 2 520 

2085 

GAGAGGACCT CGAGAATTCG AGCTCGGTAC CCGCAACACA CATCTGACCT TGTTGTTGTT 2 580 
GTGTGCTTGT TCTTTCTACT ATCAC CAAGA TGTCTTCGAA AACCTGGGAT GATGATTTCG 2 640 
2090 TTCGCCAGGT CCCGTCTTTC CAATGGATCA TAGATCAATC CTTAGAAGAC GAGGTGGAGG 2700 
CTGCTAGCCT TCAGGTGCAG GAGCCGGCAG ACGGAGTTGC CATTGACGGA TCTCTCGCGA 276 0 
GTTTTAAATT AGCTATAGCG CCCTTGGAGA TAGGAGGGGT ATTCGATCCC CCTTTTGACC 282 0 

2095 

GAGTGCG CTG GGG CTCTATT TGCGACACCG TCCAACAAAT GGTTCAACAG TTCACCGATA 2 880 
GACCGCTGAT TCCTCAAGCT GAAATGGCAC GGATGTTATA TCTTGACATT CCGGGCTCTT 294 0 
2100 TCGTGCTCGA AGATGAAATT GATGACTGGT ATCCCGAGGA TACTAGTGAT GGTTACGGTG 3 000 
TATCGTTTGC CGCCGATGAA GATCATGCGA GCGATCTAAA ACTCGCCAGT GATTCCTCGA 3 060 
ACTGTGAAAT TGAGGAAGTT CGTGTTACTG GAGATACCCC CAAGGAGCTG ACCCTTGGAG 3120 

2105 

ATAGGTACAT GGGCATTGAT GAAGAGTTTC AGACTACTAA TACTGATTAC GACATCACTC 3180 
TTCAAATCAT GAACCCTATT GAACATAGGG TTTCGCGTGT TATTGATACA CACTGCCATC 3240 
2110 CAGATAACCC TGACATCTCT ACTGGGCCAA TTTATATGGA GAGAGTCAGC CTTGCTAGAA 33 00 
CAGAAGCGAC CAGTCATTCC ATACTGCCAA CCCATGCTTA TTTCGATGAT TCGTAC CATC 3360 
AAGCCCTTGT TGAAAATGGT GATTATTCCA TGGACTTTGA TAGGATCAGA CTTAAGCAAA 3420 
GTGATGTAGA CTGGTATAGG GACCCCGATA AATATTTTCA ACCAAAAATG AATATCGGGA 348 0 



2115 
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GTGCTCAGCG AAGAGTTGGT ACTCAGAAAG AAGTCTTAAC CGCACTCAAA AAGCGAAACG 3540 
2120 CGGACGTTCC AGAAATGGGA GACGCGATTA ACATGAAGGA CACTGCGAAA GCTATAGCAA 3600 
AGCGCTTTCG TAGCACATTC CTTAATGTTG ACGGTGAAGA CTGTCTGAGA GCTTCTATGG 3660 
ATGTCATGAC TAAATGTCTT GAGTACCATA AGAAGTGGGG TAAGCACATG GACTTGCAAG 3720 

2125 

GTGTGAATGT GGCAGCAGAG ACTGATTTAT GTCGGTACCA GCATATGCTG AAGTCTGACG 3780 
TAAAACCTGT TGTAACTGAC ACCCTTCACT TGGAACGAGC AGTAGCAGCT ACTATAACAT 3 84 0 
2130 TTCATAGTAA AGGTGTGACT AGTAATTTTT CACCCTTTTT CACTGCTTGT TTCGAGAAGT 3 900 
TATCACTGGC CCTGAAATCC AGGTTCATTG TGCCTATCGG AAAGATATCC TCTCTGG AG C 3 960 
TTAAGAATGT CCGCTTGAAT AACAGATACT TTCTTGAAGC GGACCTAAGC AAATTTGATA 4 020 

2135 

AATCTCAGGG TGAGCTGCAC CTAGAGTTTC AGAGAGAGAT ACTCCTTGCG CTGGGCTTTC 4 080 
CAGCGCCGCT GACGAATTGG TGGTCTGATT TTCATCGCGA TTCTTATTTA TCAGACCCTC 414 0 
2140 ATG CCAAGGT GGGAATGTCC GTTTCCTTCC AACGCAGAAC TGGTGACGCG TTTACATATT 4200 
TCGGTAATAC TCTTGTCACT ATGGCTATGA TTGCATATGC CTCTGATCTA AGTGACTGTG 4260 
ACTGTGCAAT ATTTTCAGGA GATGATTCTT TAATCATCTC TAAAGTTAAG CCAGTCCTGG 432 0 

2145 

ATACCGATAT GTTTACGTCT CTCTTCAATA TGGAGATAAA AGTCATGGAC CCTAGTGTGC 4380 
CCTACGTTTG TAGTAAGTTT CTCGTCGAAA CTGAAATGGG CAATTTGGTG TCTGTACCAG 444 0 
2150 ATC CTCTG AG AGAGATCCAG CGCTTAGCTA AGCGAAAGAT TCTGCGTGAT GAACAGATGC 4500 
TCAGAGCACA TTTCGTTTCC TTCTGTGATC GAATGAAGTT TATTAATCAA CTTGATGAGA 4560 
AGATGATTAC GACGCTCTGT CATTTTGTTT ATCTGAAATA TGGGAAAGAA AAACCTTGGA 4620 

2155 

TTTTCGAGGA GGTTAGAGCT GCTCTTGCGG CTTTTTCTTT ATACTCCGAG AATTTCCTGA 4 680 
GGTTCTCTGA TTGCTACTGT ACCGAAGGCA TCAGAGTTTA TCAGATGAGC GATCCTGTAT 474 0 
2160 GTAAGTTCAA ACGCACCACG GAAGAGCGTA AAACTGATGG TGACTGGTTT CACAACTGGA 4800 
AGAATCCAAA GTTTCCTGGT GTGTTAGACA AAGTCTACAG AACCATTGGA ATTTATTCCT 4860 
CGGACTGTAG TACTAAGGAG CTCCCTGTCA AACGGATCGG ACGTTTACAT GAGGCCCTTG 4920 

2165 

AGCGTGAGTC ACTCAAATTA GCTAATGATC GTAGGACCAC ACAACGCTTG AAAAAGAAGG 4980 
TCGACGATTA CGCTACCGGT AGAGGAGGCC TAACGTCAGT TGATGCTTTG CTCdTGAAGT 5040 
2170 CCCATTGTGA GACTTTTAAG CCCTCTGATC TGAGATGATC GGTTCTATGA TATATGAACC 5100 
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TAAGCTGTGA ACAGCCCTTT GGTTAAGGTT 
CTAAGTTTAA AAGCTGGGGA TCCTCTAGAG 

2175 

CTATCTCTCT CTATTTTCTC CAGAATAATG 
GTT CTTATAG GGTTTCGCTC ATGTGTTGAG 
2180 TTTGTAAAAT ACTTCTATCA ATAAAATTTC 
CAGGCATGCA AGCTTGCATG CCTGCAGGTC 
TGGTTTTAGG AATTAGAAAT TTTATTGATA 

2185 

AGGGTTTCTT ATATGCTCAA CACATGAGCG 
TGGGAACTAC TCACACATTA TTCTGGAGAA 
2190 CTGGTGATTT GCGGACTCTA GAGGATCCCC 
TCTCACACAG AGACAAGCGC ATCACTTAAC 
GCCGTTAAAG CAATACT CAA AGGACTTCTT 

2195 

CTCATGTTTA AACAAATCAC ATTTGGTCGA 
AGAGATC CCT TGCGCTTCGT GTACTGTTTT 
2200 GGGAAAATCC TTAGCCCTCG TTTGAAGGGC 
CGTTCTGTTC GGATCAATAG TGACCTGCAA 
TCTAGAAACT TTGGACTCAG ATGTCCAGGA 

2205 

ACGCTTCAGC AGATTAACAG CAGCGATAAC 
AACGTCACGG CGATCATATT GCAGATTACC 
2210 CTTGAACGAA ATCTGCTCTG TGTCCCCAAA 
AGCCACCACC AGGAGTTGAC CATAATGTAG 
ACAGGACGGT ACACCGTGCA TGATCGCAGA 

2215 

CTTGGAATTA TAAGTGTCAG GGAATAAAGC 

CGCCGTCACA ATTAGGT CCT CTCCCATACG 

2220 GCATCCCGCA ACTCCATCAA CCATGGAAAT 

AGCTTCTGCA ATGTC CCTCA AGTTAGAGCA 

ATCTGCGTGC TTCGGAC CTA AACCATGCTC 
2225 
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AAAAACTCCT GGTCAGGCAG AC CACTTTGG 5160 
TCCGCAAATC ACCAGTCTCT CTCTACAAAT 5220 
TGTGAGTAGT TCCCAGATAA GGGAATTAGG 5280 
CATATAAGAA ACCCTTAGTA TGTATTTGTA 5340 
TAATTCCTAA AACCAAAATC CAGTGACCTG 5400 
GACTCTAGAG GATCCCCGGT CACTGGATTT 54 60 
GAAGTATTTT ACAAATACAA ATACATACTA 5520 
AAACCCTATA AGAACCCTAA TTCCCTTATC 5580 
AATAGAGAGA GATAGATTTG TAGAGAGAGA 5640 
GGGTACCGAG CTCGAATTCT CGAGCAGAGG 5700 
ACAATTAAAG ATCAAATCAC CAGCGAGCTC 5760 
GTGTCGTGTT AAGGCAACCA AACAGTACTC 5820 
CTTAAGCCGA ACCAAAGTGA CGTTGTCAAC 5880 
TATGTGTCCA TCAAT CCAGT CCTTGCTCAC 5940 
CGCTTTATCA GCTTGAGTCA TCGTAAGATA 6000 
ACCAGAAGTA ATACGACGCT TCGTGAGACT 6060 
TTGATACTTC GTGTCCCTAT TACCGCATTT 6120 
ATCTTGCGGA CACCGGTAAG TCTTGTGAAC 6180 
GTGGAGCAAT TTAAAACCCG CGTCACGAGA 624 0 
GGCAAGAACT TGTGAACATT TAGACAGAGC 6300 
TAAACCAGCC TCATCAACAA GCAGCCTATG 6360 
ATCCGCGGTG CGCACAACGT CCAAAGCTAC 6420 
CATCCTGACG TCCTCGGCCG ATTTACGATT 6480 
GAATG CAT CT TTTATGGCAG TGGTTTTACC 6540 
ATCGCATGTA GGGACAGAAA CTTTGGCGCT 6600 
TGCACATGTT TTATCAACAA TGTACGTTTC 6660 
ATTATATCCA ACAGTGTAAT CGTATTTTTT 6720 
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AGGATACAAC CAGTTACCGT TGGCCAAATG GACATTCACC ATATCGTCTA TGCGATGGTA 6780 
GGTCTCAAAG ATGCTCTTAT TTGCGATCTC ACTTCCGCGA CCGCCGGAAA TGTCCCATAG 6 840 
2230 GTGACGAAGA TTAGACTCGG AGTTGTTATG TAATCTCTTA CAATAACGCA CAAATTCCTT 6900 
CATGG CTCCG TGTCTAGATA TGCCACGAGG GTCCGTTGGT ACCTCAACAG ACACCTCGGC 6960 
ATCCGGGACC ACATCAGTCA CCGGTTTAAC GTCATCACTG ACGGACTCAG GGCTCGAACT 7020 

2235 

CTCAGGGGCA TCATGAAACT CCTCCTGAGG TATCTCAGCA GCTGGCGGGA CTTTCGCCTT 7080 
CTTCTTCGAG CGCTTGGTCT TGGCTGTCTG CACTTCATGC TCCAGCCGGT CGAATAAGTC 7140 
2240 CTCTTCAGTC CAAAACGTTC TCAAACGTGA TATCGGTACA GAATCTTGCT CAAATTCTTC 7200 
AACGTTTGAG AGACGAGTCA GAAACTTAAA ACTGTCCGCA TAAGAATCCA GACGTAGTAG 7260 
GGGAAATCTG CTAGCCAATG TTCTCAGCCA TCCTACTTTC GCCCTGGATG AATCTCCACC 7320 

2245 

CCACCAAAAC CTAGTTTTGA AGTGATGGCA CCAACCTTTC CATTCCATCC CATCGCGGAG 7380 
GGCCGTAAGC TTTTCGTACT TTTGATACAG ATTCAAAGTC AAAGCAAAGG CCACTAGATG 744 0 
2250 ATAATCTTCA ATGTCTAAGC GCTCACCAGC CATGATAGCC TGACCGTTAA TAATAACAGT 7500 
CGACGACTTG GCGGATAAGA TAGATGCGAC AGCTTTCATG TTCTCAGTCC ATTCTTTACT 7560 
TTCCTTGAAA CATCTGAAAG CTATCTCCTC TACCTCTCTC ACTGTGGTTT TGGCGACGCG 7620 

2255 

CACACATTTC CAGCGATTGA GACTC CAGTC TTCAGGTATT GAGACCCCTA CGTACTTAGA 7680 
TATGTCTTCA AACCATACAC AGTGACGTAG TGTCTCCCGG GGGCAGCGTA AATTTGTAGC 7740 
2260 GATGATCTTA TAGGTCATGA TGTTACATTT CAGCATTTCG CGCTCCAACA GATAGGTGGT 7800 
TCCATCGATG CAATGCACCG ACTCGGTGAA AAATGAGCCC AAATCTTGCC AT CCGTGGAT 7860 
GTAAGATAAT GTGCTTTCAT TTTCAAAATC GAATTTGATC ACCTCATCCG CGCCTGACCC 7920 

2265 

GTCACGTTGC CAGTGACATT TAAGCAAGGG AAGAAAACCC TCGCGGTCAA AC AACATGG C 7980 
GCCGTCGAAC ATAACGGTAC CACGTAGTAC GCGTACTCCA TGCGAATGCA TGGCGTCACA 804 0 
2270 CAG AC CTTGG AAGCCCATAT CATAACCGCC GTGGATACAG ATAGCCCAAT CAGCTTGGAC 8100 
ATCACAATCT TGAGCTCGGT TAAGACAAAA GTTCGGGACT TCATCGAAAT CATCGCTTTC 8160 
TTGCAAAATT TTTCGCATGC GGCACATCCT CTCCTCATGT CGGGCAGCGT CTCTAACACC 8220 

2275 

CAACACAGGA CAACAACTGT GCACCCTTTT ATCCCTTCTT GAAAAGTGAT GCCACCAAGA 8280 
CCCTCCGAAA TCTATAACGG GGTCTTCAGG GGGAAAACTG TCGAGACAGT CATAATG CTC 8340 
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2280 CGCTACACGC AGAGCACCAG CCAGGCTATG 
GTCAAAGGCA CCACCATAAC GGTCACGGAA 
GCGAACGTTG ATTTTCTTAG ACCTTTTCGC 

2285 

AACCTGATTG TCTACGATGT CTTGGGCACT 
CTTCAGCAAA TCGATAGAAC TTGACATTTT 
2290 ACCGTGGTCG AGGTCCT CTC CAAATGAAAT 
GAAGGATAGT GGGATTGTGC GTCATCCCTT 
TGCTTTGAAG ACGTGGTTGG AACGTCTTCT 

2295 

CCATCTTTGG GACCACTGTC GGTAGAGGCA 
ATGATGGCAT TTGTAGAAGC CATCTTC CTT 
2300 AGCTGGGCAA TGGAATCCGA GGAGGTTTCC 
AGCCCTCTGG TCTTCTGAGA CTGTATCTTT 
CTCCACCATG TTGACCGGGT GGTCAGTCCC 

2305 

GTGAAATCAA AAAACTCGAC GGCCTGTGGG 
TTGATCAGCG TTGGTGGGAA AGCGCGTTAC 
2310 GTTTTAACGA TCAGTTCGCC GATGCAGATA 
AGCGCGAAGT CTTTATACCG AAAGGTTGGG 
CGGTCACTCA TTACGGCAAA GTGTGGGTCA 

2315 

GCTATACGCC ATTTGAAGCC GATGTCACGC 
TCACTGGCCG TCGTTTTACA ACGTCGTGAC 
2320 CGCCTTGCAG CACATCCCCC TTTCGCCAGC 
CGCCCTTCCC AACAGTTGCG CAGCCTGAAT 
AAACGTCCGC AATGTGTTAT TAAGTTGTCT 

2325 

CCTGCCACCA GCCAGCCAAC AGCTCCCCGA 
TACAGGCAGC CCATCAGNNN NNNNNNNNNN 
2330 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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GGGCGCATGA TACTGCTGAG TCAAATTTAA 8400 
GGCGTCAGCC TCCTCAATAG AGAGCTTATT 8460 
GTATTCAATC TGCGCAGATA ACTGTTGCGC 8520 
CTGGCTGTCA GCACCCTTCT CAGCAATCAA 8580 
GTTGGTGAAA AACAAAGAAC AAGTAGCAGA 8640 
GAACTTCCTT ATATAGAGGA AGGGTCTTGC 8700 
ACGTCAGTGG AGATATCACA TCAATCCACT 8760 
TTTTC CACG A TGTTCCTCGT GGGTGGGGGT 8820 
TTCTTGAACG ATAGCCTTTC CTTTATCGCA 888 0 
TTCTACTGTC CTTTCGATGA AGTGACAGAT 894 0 
CGATATTACC CTTTGTTGAA AAGTCTCAAT 9000 
GATATTCTTG GAGTAGACGA GAGTGTCGTG 9060 
TTATGTTACG TCCTGTAGAA ACCCCAACCC 9120 
CATTCAGTCT GGATCGCGAA AACTGTGGAA 9180 
AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 9240 
TTCGTAATTA TGCGGGCAAC GTCTGGTATC 9300 
CAGGCCAGCG TATCGTGCTG CGTTTCGATG 9360 
ATAATCAGGA AGTGATGGAG CATCAGGGCG 9420 
CGTATGTTAT TGCCGGGAAA AGTGTACAAT 9480 
TGGGAAAACC CTGGCGTTAC CCAACTTAAT 9540 
TGGCGTAATA GCGAAGAGGC CCGCACCGAT 9600 
GGCGAATGNN NNNNNAATTC AGTACATTAA 9660 
AAGCGTCAAT TTGTTTACAC CACAATATAT 9720 
CCGGCAGCTC GG CACAAAAT CACCACTCGA 9780 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9840 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10140 
NNNNNNNNNN NNNNNNNNNN NNNNNN 10166 
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DESCRIPTION 



IMPROVED METHODS AND MATERIALS FOR TRANSFORMATION 

5 This invention was made with United States government support awarded by the 

following agency: 

NIH Grant No: GM35072 
The United States has certain rights in this invention. 

10 Background of the Invention 

RNA viruses have been found to be valuable tools in the phenotypic and genotypic 
transformation of targeted cells and tissues. See, e.g., U.S. Patent No. 5,500,360, which teaches 
novel viral RNA expression vectors. It has been shown that the RNA of the genome of an RNA 
virus can be modified to include an exogenous RNA segment and that the modified RNA can 

1 5 be introduced into a host cell, replicated therein, and thereby express the exogenous RNA 
segment. 

Current methods of inoculating a host cell with modified RNA viruses involve the in 
vitro transcription of a particular strand followed by the introduction of the resulting RNA 
transcripts into the host cell. One problem with the current inoculation method is that the RNA 

20 rapidly degrades which causes a low efficiency of infection. In addition, the preparation of the 
in vitro RNA transcripts is expensive and time consuming. 

Further, with the advent of transformation and the genetic engineering of plants, much 
concern has arisen concerning the potential hazard of the dispersal of dangerous traits into the 
environment. For example, genes increasing the stress tolerance and/or herbicide resistance of 

25 an agriculturally important crop could theoretically "leak" to surrounding less desirable and 
damaging plants, e.g., through pollen, mechanical or insect dispersal. This phenomenon could 
create a novel species of "super-weed" which could wreak havoc on the agricultural industry. 
Existing RNA virus-based vectors can spread to non-target plants by mechanical means and/or 
by insects. Such spread can be prevented by using vectors that can replicate and/or move only 

30 in target plants expressing the appropriate trans-acting factors. Accordingly, there remains a 
need for less expensive and more efficient methods of transformation of target cells and tissues. 
Moreover, there is a need for a novel method of transformation which alleviates the potential 
dangers associated with the unwanted spread of engineered traits into the environment. 
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Prief Sumipary pf the Inveptipn 
. The subject invention pertains to improved materials and methods for transforming host 
cells which involve transfecting said cells with a DNA-launching platform. One aspect of the 
subject invention pertains to a DNA-launching platform which encodes a modified viral RNA 
5 molecule downstream of DNA-dependent RNA polymerase (pol) promoter, whereby the DNA- 
launching platform is capable of being introduced into a host cell and effectively "launching" 
said modified viral RNA molecule into the host cell such that it is replicated and expressed 
therein. The term "modified viral RNA molecule" as used herein refers to a viral RNA which 
has been changed from its natural state. Examples of changes of viral RNA include, but are not 

1 0 limited to, removal of a part of viral RNA genome, insertion or substitution of an exogenous 
RNA, etc. The exogenous RNA segment can be located in a region of the viral RNA molecule 
such that it does not disrupt the RNA replication. Techniques for such manipulations have been 
well known to those of ordinary skill in the art for many years. Preferably, the modified viral 
RNA molecule further comprises a ribozyme which is located in the proximity of the 3' end of 

1 5 the modified viral RNA molecule. The viral segment may have the ability to be replicated with 
or, alternatively, without the presence of trans-acting viral replicating elements. 

Another aspect of the subject invention pertains to a method of genotypically or 
phenotypically modifying a host cell, comprising introducing a DNA-launching platform which 
encodes a viral RNA molecule and an exogenous RNA segment in a location which does not 

20 disrupt the replication of said viral RNA segment or said exogenous RNA segment, whereby the 
exogenous RNA segment confers a detectable trait in the host cell. The subject invention applies 
to a wide array of plant cells. 

Still a further aspect of the subject invention pertains to cells in which the DNA- 
launching platform of the subject invention has been introduced. 

25 Yet another aspect of the subject invention pertains to a plant comprising cells 

transfected with the DNA-launching platform. 

The novel methods and materials of the subject invention provide a greater inoculation 
efficiency of RNA viruses because use of DNA-launching platforms of the subject invention are 
more resistant to degradation than RNA inocula, and because each DNA platform produces 

30 multiple RNA transcripts over an extended period of time. As the DNA-launching platform 
provides a genetically stable in planta archive copy of a desired vector construct, the continuing 
transcription of said DNA platform will repeatedly reinoculate the host cell with the desired 
construct. This serves to counteract genetic instability problems that have inhibited the 
expression of some genes from vectors based on plant and animal RNA viruses. Further, the 
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inoculation methods of the subject invention provide a much simpler means of producing inocula 
in bulk for large scale use, which is cheaper and more efficient than inoculating with in vitro 
RNA transcripts. 

5 RHef Description of the Drawings 

Figure 1 represents the schematic for producing the 1 a and 2a proteins in the host cell. 
Figure 2 illustrates an example of an Agrobacterium transformation vector containing 
an expression cassette capable of expressing la and/or 2a BMV proteins. 

Figure 3 illustrates several Agrobacterium vectors that were produced to transform host 
1 0 plant cells (black rectangles indicate T-DNA borders). 

Figure 4 represents the general mechanism of BMV RNA3 launching, and replication. 
Figure 5 depicts DNA-launching platforms which can be used in accord with the 
teachings contained herein. The BMV and CCMV designations denote cis-acting elements. 

Figure 6 depicts DNA-launching platforms which can be used in accord with the 
1 5 teachings contained herein. 

Figure 7 depicts DNA-launching platforms which can be used in accord with the 

teachings contained herein. 

Figure 8 depicts DNA-launching platforms which can be used in accord with the 

teachings contained herein. 
20 Figure 9 depicts Agrobacterium vector for delivery of DNA-launching platforms to 

plant cells (open triangles represent T-DNA borders). 

Figure 10 depicts DNA-launching platforms which can be used in accord with the 
teachings contained herein. 



25 Legend For Figures 5-10: 

35S = CaMV35S promoter 

t = termination/polyA + sequences - 

Rz = ribozyme 

NOS = NOS promoter 

30 OOA = origin of assembly 

FG = foreign gene 



Figure 11 shows that BMV replication factors support efficient RNA3 replication ii 
protoplasts. 
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Figure 12 shows the efficient replication of launched BMV RNA3 in protoplasts. 
Figure 13 shows transgenic expression of BMV la and 2a mRNAs in K tabacum and 
TV. benthamiana. 

Figure 14 shows the efficient replication of launched BMV RNA3 in (la + 
5 2a)- transgenic plants. 

Figure 15 shows the successfiil GUS expression from the launched BMV RNA3 in (la 
+ 2a)- transgenic plants. 

Figure 16 shows the successful GUS expression from the launched BMV RNA3 in 
protoplasts. 

10 Figure 17 shows the successful GFP expression from the launched BMV RNA3 in (la 

+ 2a) - transgenic plants. 

Figure 18 shows the successful GFP expression from the launched BMV RNA3 in 
protoplasts. 

Figure 19 shows the efficient replication of the launched BMV RNA3 in (la + 2a)- 
1 5 transgenic N. benthamiana using Agrobacterium inoculation. 

Figure 20 shows the successful GUS expression from the launched BMV RNA3 having 
the SHMV coat protein in (la + 2a)-transgenic plants. 

Figure 21 shows that launched BMV replicates, moves cell-to-cell, and spreads long 
distances in (la+2a)-transgenic plants. 
20 Figure 22 shows transfection of progeny from ( 1 a+2a>transgenic N. benthamiana with 

BMV RNA3 DNA-launching platform and localization of the launched RNA3 to the roots. 

Brief Description of t he Sequences 
SEQ ID NO. 1: pBlLR2 - partial nucleotide sequence includes BMV la expression 
25 cassette. 

SEQ ID NO. 2: pB!LR3 - partial nucleotide sequence includes BMV la expression 
cassette. 

SEQ ID NO. 3: pB2LR4 - partial nucleotide sequence includes BMV 2a expression 
cassette. 

30 SEQ ID NO. 4: pB2LR5 - partial nucleotide sequence includes BMV 2a expression 

cassette. 

SEQ ID NO. 5: pB12LR6 - partial nucleotide sequence includes BMV la and 2a 
expression cassettes. 
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SEQ ID NO. 6: pB12LR7 - partial nucleotide sequence includes BMV la and 2a 
expression cassettes. 

SEQ ED NO. 7: pB12LR8 - partial nucleotide sequence includes BMV la and 2a 
expression cassettes. 

5 SEQ ID NO. 8: pB12LR9 - partial nucleotide sequence includes BMV la and 2a 

expression cassettes. 

Detailed Dis closure of the Invention 
To facilitate understanding of the invention, certain terms used throughout are herein 
0 defined. The term "RNA virus" as used herein means a virus whose genome is RMA in a 
double-stranded or single-stranded form, the single strand being a (+) strand or (-) strand. 

The terms "transfection" or "transfected" as used herein means an introduction of a 
foreign DNA or RNA into a cell by mechanical inoculation, electroporation, agroinfection, 
particle bombardment, microinjection, or by other known methods. 
5 The terms "transformation" or "transformed" as used herein means a stable 

incorporation of a foreign DNA or RNA into the cell which results in a permanent, heritable 
alteration in the cell. Accordingly, the skilled artisan would understand that transfection of a cell 
may result in the transformation of that cell. 

The term "launched" as used herein refers to a polynucleotide that has been transcribed 
20 from a DNA-launching platform, as described herein and, preferably, replicated. 

The term "cis-acting element" as used herein denotes that portion of the RNA genome 
of an RNA virus which must be present in cis, that is, present as a part of each viral strand as a 
necessary condition for replication of that strand. Virus replication may depend upon the 
existence of one or more trans (diffusible) elements which interact with the cis-acting element 
25 to carry out RNA replication. If trans-acting elements are necessary for replication, they need 
not be present or coded for on the modified viral RNA provided, but may be made available 
within the infected cell by some other means. For example, the trans-acting replication functions 
may be provided by other, unmodified or modified, components of the viral genome transfected 
into the cells simultaneously with the modified RNA. The same approach can be used for other 
30 trans-acting functions including movement protein, coat protein, and other functions. The target 
cell may also be premodified, for example, cells may have been previously transformed to 
provide constitutive expression of the trans-acting functions from a chromosome. The cis-acting 
element is composed of one or more segments of viral RNA which must be present on any RNA 
molecule that is to be replicated within a host cell by RNA replication. The segment will most 
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likely be the 5' and 3' terminal portions of the viral RNA molecule, and may include other 
portions and/or virus open reading frames as well. The cis-acting element is accordingly defined 
in functional terms: any modification which destroys the ability of the RNA to replicate in a cell 
known to contain the requisite trans-acting elements, is deemed to be a modification in the cis- 
5 acting element. Conversely, any modification, such as deletion or insertion in a sequence region 
which is able to tolerate such deletion or insertion without disrupting replication, is a 
modification outside the cis-acting element. As is demonstrated herein, using the example of 
BMV which is known and accepted by those skilled in the art to be a functional example from 
which substantial portions of an RNA virus molecule may be modified, by deletion, insertion, 

10 or by a combination of deletion and insertion, without disrupting replication. 

"Exogenous RNA" is a term used to describe a segment or component of RNA to be 
inserted into the virus RNA to be modified, the source of the exogenous RNA segment being 
different from the RNA virus itself. The source may be another virus, an organism such as a 
plant, animal, bacteria, virus, or fungus. The exogenous RNA may be a chemically synthesized 

1 5 RNA, derived from a native RNA, or it may be a combination of the foregoing. The exogenous 
RNA may provide any function which is appropriate and known to be provided by an RNA 
segment. Such functions include, but are not limited to, a coding function in which the RNA 
acts as a messenger RNA encoding a sequence which, when translated by the host cell, results 
in synthesis of a peptide or protein having useful or desired properties; the RNA segment may 

20 also be structural, as for example in ribosomal RNA; it may be regulatory, as for example with 
small nuclear RNAs or anti-sense RNA; or it may be catalytic. One skilled in the art will 
understand that the exogenous RNA may encode, for example, a protein which is a key enzyme 
in a biochemical pathway, which upon expression effects a desirable phenotypic characteristic, 
such as altering cell metabolism. Further, the exogenous RNA may encode a protein involved 

25 in transcriptional regulation, such as zinc finger, winged-helix, and leucine-zipper proteins. A 
particularly interesting function is provided by anti-sense RNA, sometimes termed (-) strand 
RNA, which is in fact a sequence complementary to another RNA sequence present in the target 
cell which can, through complementary base pairing, bind to and inhibit the function of the RNA 
in the target cell. 

30 The term "non-viral" is used herein in a special sense to include any RNA segment 

which is not normally contained within the virus whose modification is exploited for replication 
and expression, and is therefore used synonymously with "exogenous". Accordingly, a gene 
derived from a different virus species than that which is modified is included within the meaning 
of the terms "non-viral" and "exogenous" for the purposes of describing the invention. For 
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example, a non-viral gene as the term is used herein could include a gene derived from a 
bacterial virus, an animal virus, or a plant virus of a type distinguishable from the virus modified 
to effect transformation. In addition, a non-viral gene may be a structural gene derived from any 
prokaryotic or eukaryotic organism. 
5 In one embodiment, the subject invention concerns a novel method of transfecting a host 

cell which uses a DNA-launching platform to introduce viral RNA into the cell. The subject 
invention is directed towards a method of transfection employing a DNA-launching platform 
which encodes a modified viral RNA molecule comprising an RNA viral component attached 
to an exogenous RNA component and a DNA-dependent RNA pol promoter. The DNA- 

1 0 dependent RNA pol promoter is preferably but not necessarily fused within up to 1 0 nucleotides 
of the 5' transcriptional start site of the modified viral RNA molecule, and more preferably 
within up to 5 nucleotides of the 5' transcriptional start site. Expression of the DNA-launching 
platform produces transcripts of the modified viral RNA molecule that are then capable of RNA 
replication in the presence of replication factors, which can be present in the modified viral RNA 

15 and/or may be supplied in trans by other means including expression from chromosome or 
supplied on different launching plasmids. When the modified viral RNA is replicated, the 
exogenous RNA can be replicated as well. Further, the exogenous RNA can be expressed in the 
cell, thereby providing a predetermined phenotypic characteristic. In a preferred embodiment, 
the DNA launching platform further comprises a nucleotide sequence encoding a self-cleavable 

20 ribozyme situated proximate to the 3' end of said RNA molecule. As would be readily apparent 
to those skilled in the art, known ribozymes may be used in accordance with the subject 
invention. In a preferred embodiment, the ribozyme cleaves the modified RNA viral molecule 
at the 3' region. The 3' region can consist of up to 30 nucleotides upstream or downstream of 
the 3 ' end; and preferably consists of up to 1 0 nucleotides upstream or downstream of the 3 ' end. 

25 In a more preferred embodiment, the ribozyme cleaves the modified RNA viral molecule 
precisely at the 3 ' end. Other known regulatory sequences, e.g., promoters and/or termination 
sequences, may also be substituted for and/or included on the DNA-launching platform. A 
suitable restriction site can be introduced proximate to the 3' end of the modified viral RNA 
molecule sequence and the DNA molecule can be cleaved by an appropriate restriction enzyme 

30 prior to transfection. The term "DNA-launching platform" as used herein is intended to mean 
a DNA molecule, circular or linear, which has a coding region comprising a segment encoding 
a modified viral RNA segment, and further, which is capable of being delivered into a cell and 
subsequently transcribed. 
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Possible regulatory sequences can include, but are not limited to, any promoter already 
shown to be constitutive for expression, such as those pf viral origin (CaMV 19S and 35S) or 
so-called "housekeeping" genes (ubiquitin, actin, tubulin) with their corresponding 
termination/polyA + sequences. Also, seed-and/or developmental ly-specific promoters, such 
5 as those from plant fatty acid/lipid biosynthesis genes (ACPs, acyltransferases, desaturases, lipid 
transfer protein genes) or from storage protein genes (zein, napin, cruciferin, conglycinin, 
phaseolin, or lectin genes, for example), with their corresponding termination/polyA + sequences 
can be used for targeted expression. In addition, the gene can be placed under the regulation of 
inducible promoters and their termination sequences so that gene expression is induced by light 

1 0 (rbcS-3 A, cab- 1 ), heat (hsp gene promoters) or wounding (mannopine, HGPGs). It is clear to 
one skilled in the art that a promoter may be used either in native or truncated form, and may be 
paired with its own or a heterologous termination/polyA + sequence. 

In a particularly preferred embodiment, the subject invention is directed toward a 
method of genotypically or phenotypically modifying a cell comprising the following steps: a) 

1 5 forming a cDNA molecule of a virus RNA, or of at least one RNA component if the RNA virus 
is multipartite, the viral RNA having been modified to contain a DNA segment encoding a non- 
viral RNA component situated in a region able to tolerate such insertion without disrupting 
replication of the RNA product encoded thereby; b) cloning modified cDNA into a DNA- 
launching platform; and c) transfecting a suitable host cell with said DNA-launching platform. 

20 In a most preferred embodiment, the method further comprises pretransforming a plant with 
trans-acting viral replication factors and/or other trans-acting factors. Such trans-acting factors 
may include viral movement proteins(s), coat protein(s), viral protease(s), and other structural 
and nonstructural genes. In addition to stable expression of trans-acting factors, trans-acting 
factors may be introduced on separate expression plasmids or may be expressed from RNA 

25 transcripts. In a preferred embodiment such trans-acting factors do not replicate. Suitable host 
cells may include protoplasts, cells in suspension, or cells in tissues or whole organisms. 

In a specific embodiment intended as an example of the broader teachings herein, the 
RNA viral segment can be derived from brome mosaic virus (BMV), whereby the DNA- 
launching platform comprises DNA encoding the RNA3 segment of the virus. Brome mosaic 

30 virus (BMV) is a member of the a virus-like super family of positive-strand RNA viruses of 
animals and plants, and has a genome divided among three RNAs. RNA 1 and RNA2 encode the 
la and 2a proteins, respectively, which are necessary for a genomic RNA replication and 
subgenomic mRNA synthesis (see, e.g., U.S. Patent No. 5,500,360, which to the extent not 
inconsistent herewith, is incorporated herein by reference). These proteins contain three 
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domains conserved in all other members of the o virus-like super family. 1 a ( 1 09kDa) contains 
a c-proximal helicase-Hke domain and an n-proximal domain implicated in RNA capping, and 
2a (94kDa) contains a central polymerase-like domain. See, e.g., French and Ahlquist, (1988). 
la and 2a interact with each other and with cell factors to form a membrane bound viral RNA 
5 replication complex associated with the endoplasmic reticulums of infected cells. BMV RNA3, 
a 2.1-kb RNA, encodes the 3a protein (32kDa) and coat protein (20kDa), which are involved in 
the spread of BMV infection in its natural plant hosts but are dispensable for RNA replication. 
See U.S. Patent No. 5,500,360. The 3 a or coat protein gene of the RNA3 viral segment can be 
replaced with exogenous RNA, whereby it does not interfere with the replication element. 
1 0 Further, the exogenous RNA segment can be inserted downstream of an additional subgenomic 
promoter. Still further, cells or tissues can be pretransformed to express la, 2a, 3a, and coat 
protein, or any combination thereof, wherein DNA-launching platforms containing a foreign 
gene(s) with the necessary cis-acting components is transfected, such that the foreign gen is 
replicated and/or expressed. 
1 5 In one embodiment, the host cell is pretransformed with BMV 1 or BMV2 such that it 

is transgenically engineered to express la and 2a proteins. Preferably, the 5' and 3' ends of 
BMV1 and BMV2 are removed such that they are incapable of replication, but can express la 
and 2a to form a viral RNA replication complex associated with the endoplasmic reticulum of 
the host ceil. Subsequent transfection of a DNA-launching platform comprising the RNA3 viral 
20 replication segment, as well as the exogenous RNA of interest, can produce the expression of 
said exogenous RNA while also preventing the undesired and dangerous spread of viral RNA 
spillage into the environment. That is, because a plant must have all 3 segments to form 
infectious BMV particie(s), problems associated with the environmentally hazardous escape of 
foreign genes through mechanical or insect dispersal of RNA virus vectors are avoided. One 
25 skilled in the art will readily appreciate that in the example of BMV that DNA-launching 
platforms could be also derived from either RNA1 or RNA2. For example, the sequence 
encoding the la protein could be replaced with an exogenous RNA; replication would require 
the expression of la (e.g., separate expression plasmid). In a preferred embodiment, the DNA- 
launching platform also comprises a ribozyme situated proximate to the 3 ' end of the modified 
30 RNA3, wherein said ribozyme cleaves the RNA3 at the 3 ' end. As would be readily apparent 
to the skilled artisan with the teachings contained herein, viral segments from other known 
viruses, and/or subviral agents, can be used to formulate DNA-launching platforms of the subject 
invention. One skilled in the art will appreciate that BMV is merely one representative example 
of the many viruses suitable for practicing the subject invention. It is widely accepted that 
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principles on which the subject invention is based are broadly applicable to a myriad of viruses. 
Examples of other such viruses include, but are not limited to, alfalfa mosaic virus (AMV), 
barley stripe mosaic virus, cowpea mosaic virus, cucumber mosaic virus, reoviruses, polio virus, 
sindbis virus, vesicular stomatitis virus, influenza virus, retroviruses, and cowpea chlorotic 
5 mottle virus (CCMV) and any other viruses that replicate through RNA intermediates and from 
. which a cDNA copy can be obtained. Specifically, as the other viruses are further characterized, 
those of skill in the art will readily appreciate the applicability of the teachings herein to other 
suitable viruses as well. 

The skilled artisan would easily appreciate that known methods of introducing foreign 

1 0 DNA into cells can be used in accordance with the teachings of the subject disclosure. Such 
methods include, but are not limited to, mechanical inoculation, particle bombardment, 
agroinfection, electroporation, and microinjection, as well as other known methods. 

Various aspects of the invention can be modified as needed, depending upon specific 
characteristics of the virus selected as the transforming and transfecting agent and of the RNA 

1 5 segment to be inserted. For example, the inserted gene need not be a naturally occurring gene, 
but may be modified, a composite of more than one coding segment, or it may encode more than 
one protein. The RNA may also be modified by combining insertions and deletions in order to 
control the total length or other properties of the modified RNA molecule. The inserted non- 
viral gene may be either prokaryotic or eukaryotic in origin. The inserted gene may contain its 

20 own translation start signals, for example, a ribosomal binding site and start (AUG) codon, or 
it may be inserted in a manner which takes advantage of one or more of these components 
preexisting in the viral RNA to be modified. Certain structural constraints must be observed to 
preserve correct translation of the inserted sequence, according to principles well understood in 
the art. For example, if it is intended that the exogenous coding segment is to be combined with 

25 an endogenous coding segment, the coding sequence to be inserted must be inserted in reading 
frame phase therewith and in the same translational direction. 

It will be understood by those ordinarily skilled in the art that there may exist certain 
genes whose transfer does not result in obvious phenotypic modification of the recipient cell. 
Such may occur, for example, if the translation product of the non-viral gene is toxic to the host 

30 ceil, is degraded or processed in a manner which renders it non-functional or possesses structural 
features which render it impossible for the host cell to translate in sufficient quantities to confer 
a detectable phenotype on the transformed cells. However, the invention does not depend upon 
any specific property of an RNA segment or gene being transferred. Therefore, the possible 
existence of RNA segments or genes which fail to confer a readily observable phenotypic trait 



WO 99/61597 PCT/US99/11250 

11 

on recipient cells or plants is irrelevant to the invention, and in any case will be readily 
recognizable by those of ordinary skill in the art without undue experimentation. 

An exogenous RNA segment may be inserted at any convenient insertion site in any of 
the cDNA sequences corresponding to a viral RNA, or component RNA of a multipartite RNA 
5 virus, provided the insertion does not disrupt a sequence essential for replication of the RNA 
within the host cell. For example, for a virus whose coat protein is not essential for replication, 
an exogenous RNA segment may be inserted within or substituted for the region which normally 
codes for coat protein. As desired, regions which contribute to undesirable host cell responses 
may be deleted or inactivated, provided such changes do not adversely affect the ability of the 

1 0 RNA to be replicated in the host cell. For many single component and multipartite RNA viruses, 
a reduction in the rate of normal RNA replication is tolerable and will in some instances be 
preferred, since the amount of RNA produced in a normal infection is more than enough to 
saturate the ribosomes of the transformed cell. 

Plant cells which are inoculated in culture will normally remain transfected as the cells 

1 5 grow and divide since the RNA components expressed from the DNA-launching platform are 
able to replicate and thus become distributed to descendant cells upon cell division. Plants 
regenerated from phenotypically modified cells, tissues, or protoplasts remain phenotypically 
modified. Similarly, plants transfected as seedlings remain transfected during growth. Optimal 
timing of application of the transfecting components will be governed by the result which is 

20 intended and by variations in susceptibility to the transfecting components during various stages 
of plant growth. 

Many plant RNA viruses are seed transmitted from one generation to the next. This 
property can be exploited to effect genotypic transformation of a plant. That is to say, the 
modified RNA remains transmissible from one generation to the next, just as seed-borne virus 
25 infections are transmitted from one generation to the next. 

Following are examples which illustrate procedures for practicing the invention. These 
examples should not be construed as limiting. All percentages are by weight and all solvent 
mixture proportions are by volume unless otherwise noted. 



30 



Fxample 1 - Construction of Avrobacterivm Vectors 

Binary vectors for expressing the BMV la and 2a proteins in plants were constructed. 
Starting with the pBHOl .2 construct (Clontech, Palo Alto, CA), the GUS gene was removed by 
first cutting the construct with EcoRI and SnaBI. The overhanging restriction fragment ends 
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were filled in by treatment with Klenow fragments and dNTPs. The restriction fragment ends 
were religated forming the pB 1 0 1 .2LR1 . 

The 2a expression cassette was inserted into pBI 1 0 1 .2 LR 1 . First the pBI 1 0 1 .2LR 1 was 
cut with Hind III and dephosphorylated. Next, pB2PA 1 7 (Dinant et al , 1 993) was cut with Hind 
5 III and the 2a insert was purified using a low melting agarose gel. The restriction fragment ends 
were ligated forming the pB2LR4 and pB2LR5 (Figures 3c and 3d). 

The la expression cassette was inserted into pBI101.2LRl by first cutting pBI101.2LRl 
with SnaBI and dephosphorylated. pBlPAl 7 (Dinant et al. 9 1993) was cut with PstI and the 
extra nucleotides were removed with T4 DNA polymerase. The 1 a insert was purified using a 
1 0 low melting agarose gel. The restriction fragment ends were ligated forming the pBlLR2 and 
pBlLR3 vectors (Figures 3a and 3b). 

The la expression cassette was inserted into pB2LR4 and pB2LR5 by cutting pB2LR4 
or pB2LR5 with SnaBI and dephosphorylated. PB 1 PA 1 7 (Dinant et aL* 1 993) was cut with PstI, 
and the extra nucleotides were removed with T4 DNA polymerase. The la insert was purified 
1 5 using low melting agarose gel and ligated with the cut pB2LR4 or pB2LR5 vectors to form 
pB12LR6, pB12LR7, pB12LR8, and pB12LR9 vectors (Figures 3e-3h). 

Example 2 - Construction of DNA- launching Platform for wtRNA3 of BMV and for RNA 

Derivatives Containing Foreign Sequences 
20 Vector pRTlOl (Topfer et al. 9 1987) was cut with PpuMI and the restriction fragment 

ends were filled in with Klenow fragment and dNTPs, and cut with BamHI and 

dephosphorylated. Vector pB3RQ39 (Ishikawa et aL, 1997) was cut with SnaBI and BamHI; 

the B3 fragment was isolated from a low melting agarose gel. This fragment was ligated to the 

cut pRTlOl thereby forming pB3LR10 (Figure 4). The pB3LRl 5 (Figure 4) that is a pB3LR10 
25 derivative has the Clal-Kpnl fragment replaced with the corresponding fragment from pB3TP8 

(Janda et aL, 1987). 

PCR was performed on pRTlOl to amplify an EcoRV and EcoRI fragment. To create 
a StuI site instead of a PpuMI site, a one nucleotide deletion was performed during the PCR 
process. The resulting PCR product was cut with EcoRV and EcoRI and inserted into 
30 dephosphorylated pRTl 01 cut with EcoRV and EcoRI to form pRTlOlLRll. ThepRTlOlLRl 1 
was cut with StuI and BamHI and dephosphorylated. PB3RQ39 was cut with SnaBI and BamHI 
and a B3 fragment was isolated using a low melting agarose gel. The fragment was then ligated 
to pRTlOlLRl 1 to form pB3LR12 (Figure 4). 
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Another DNA-launching platform was constructed with wtRNA3 of BMV having a 
partially doubled CaMV35S promoter; thereby forming pB3LR14 and pB3LR16 (Figure 4). 

A DNA-launching platform wherein the BMV RNA3 coat protein was replaced with 
GUS was also constructed. The pB3MI22 (Ishikawa et ai 9 1997) was cut with Clal and Stul and 
5 a B3GUS insert was isolated. The pB3LR10 or pB3LR14 DNA-launching constructs were cut 
with Clal and Stul and dephosphorylated. The B3GUS fragment was then ligated to the cut 
pB3LR10 or pB3LR14 thereby forming the pB3GUSLR17 and pB3GUSLR18 DNA-launching 

constructs (Figure 5). 

A DNA-launching platform having a BMV RNA3 with a GUS gene insertion wherein 

10 the GUS is downstream of an additional BMV subgenomic promoter was constructed. The 
pB3LR15 construct was cut with Aval and the restriction fragment ends were filled in with 
Klenow fragment and dNTPs. Construct was then cut with Clal and dephosphorylated. The 
pB3MI22 was cut with Clal and Stul and a B3GUS fragment was isolated. The isolated B3GUS 
fragment was then ligated to the cut pB3LR15 construct to form a new construct of 

15 pB3GUSCPLR19 (Figure 5). 

A BMV RNA3 based DNA-launching platform with a CP gene inserted downstream of 
an additional cowpea chlorotic mottle virus (CCMV) subgenomic promoter was constructed. 
The pB3GUSLRl 7 construct was cut with Stul and Kpnl and dephosphorylated. The pBC3AJ14 
(Pacha and Ahlquist, 1991) was cut with Ndel, the ends were blunted by known methods in the 

20 art, and then cut with Kpnl. A coat protein fragment was then isolated. The coat protein 
fragment was then ligated to the cut pB3GUSLRl 7 to form a new construct of pB3GUSCPLR22 
(Figure 5). 

A DNA-launching platform was constructed having a subgenomic RNA4. The pB4MK2 
(M. Kroll, personal communications) was cut with SnaBI and BamHI and a RNA4 fragment was 
25 then isolated. The pRTlOlLRl 1 construct was cut with Stul and BamHI and dephosphorylated. 
The fragment and the cut pRTlOlLRl 1 construct were then ligated forming pB4LR20 (Figure 
5a). 

A DNA-launching platform wherein the BMV coat protein was replaced with GFP was 
constructed. pEGFP (Clontech, CA) was cut with NotI, filled in with Klenow fragment and 
30 dNTPs ? cut with Sail, and GFP insert was isolated using low-melting agarose gel. The 
pB3LR15 was cut with Sail and Stul and dephosphorylated. The GFP fragment was then ligated 
to the cut pB3LRl 5 thereby forming the pB3GFPLR48 (Figure 6e). 

A DNA-launching platform having a BMV RNA3 with a GFP gene insertion wherein 
the CP is downstream of an additional CCMV subgenomic promoter was constructed. The 
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pBC3AJ14 (Pacha and Ahlquist, 1991) was cut with Ndel and EcoRI and the ends were blunted 
by known methods in the art. The coat protein fragment was then isolated and ligated into 
dephosphorylated and blunted pEGFP cut with NotI and StuI forming pEGFPCPLR49. 
pEGFPCPLR49 was cut with Kpnl and the EGFPCP fragment was isolated using low-melting 
5 agarose gel. PB3GFPLR48 was cut with Kpnl and dephosphorylated. The EGFPCP fragment 
was then ligated to the cut pB3GFPLR48 thereby forming the pB3GFPCPLR50 (Figure 6a). 

An RN A transcription vector wherein the GFP gene is expressed as a translational fusion 
with BMV 3a was constructed. The pB3TP10 (Pacha and Ahlquist, 1991 ) was cut with BamHl 
and dephosphorylated. The GFP fragment was amplified from pEGFP (Clontech, CA) using 
1 0 PCR and the following primers: 

5 'GCAGTCGACGGTACCGCGGGCC3 ' 
and 

5 'CGCGGCCGCGGATCCTGTACAGCTCG3 ' . 
The amplified product was cut with BamHI and purified using low-melting agarose gel. The 
15 GFP fragment was ligated to the cut pB3TP10 forming pB3GFPLR47 (Figure 6d). The 
pB3GFPLR47 was cut with EcoRI and transcribed using T7 RNA polymerase. 

An Agrobacterium vector containing BMV RNA3 DNA-launching platform was 
constructed. The pBI101.2LRl was cut with Smal and dephosphorylated. The pB3LR15 was 
cut with Pvull and the B3 fragment was purified using a low-melting agarose gel. The B3 
20 fragment was then ligated to the cut pBI101.2LRl thereby forming pB3LR42 (Figure 9). 

A DNA-launching platform wherein the BMV RNA3 coat protein was replaced with the 
SHMV (Sunn hemp mosaic virus) coat protein and the GUS gene was inserted downstream of 
an additional BMV subgenomic promoter was constructed. The pB3RS4 (Sacher et al. 9 1988) 
was cut with Aval, blunted with Klenow fragment and dNTPs, and cut with Kpnl. The SHMV 
25 coat protein fragment was isolated using a low-melting agarose gel. The pB3GUSLRl 7 was cut 
with StuI and Kpnl and dephosphorylated. The SHMV coat protein fragment was ligated to the 
cut pB3GUSLRl 7 thereby forming pB3GUSCPLR24 (Figure 7). 

Other permutations of DNA-launching platforms containing one or more foreign genes 
and the necessary cis-acting replication signals will be readily appreciated in view of the 
30 teachings herein. For examples, see Figures 5-10. 
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Example 3 - Transfection of N. tahacum Protoplasts with DNA-launching Platform 
Media: 

NT1 Medium (1 liter) was made with Gibco-BRL (MS salt, catalog #1 1 1 18-03 1), 3ml 
of 6% KH2P04, and 0.2 ptg/ml 2,4D (final concentration). The pH was adjusted to 5.5-5.7 using 
5 KOH, and the resulting mixture was autoclaved. 

NT1 Plating Medium (1 liter) was made with NT1 medium and 72.86 g mannitol, the 
pH was adjusted to 5.5-5.7, and the resulting mixture was autoclaved. 

Wash Solution (1 liter) was made with 72.86 g mannitol, the pH was adjusted to 5.5, and 
the resulting mixture was autoclaved. 
1 0 Electroporation Buffer was made with 0.8% NaCl, 0.02% KC1, 0.02% KH2P04, 0. 1 1% 

Na2HP04 ? and 0.4M mannitol. The pH was adjusted to 6.5, and the resulting mixture was 
autoclaved. 

Enzyme Solution was made with 0.4M mannitol, and 20mM MES. The pH was adjusted 

to 5.5, and the resulting mixture was autoclaved. 
1 5 Growth conditions: Cells (Nicotiana tabacum) were grown at room temperature in NT1 

media with constant shaking (about 200 rpm). 

Preparation of cultures for digestion: About 2-3 ml of one-week old suspension culture 

was subcultured into 50 ml of fresh NT1 media 3 days before the enzyme digestion. The culture 

was maintained at 28 °C under constant shaking. 
20 Enzyme digestion: The enzyme digestion solution was prepared containing the 

following: 1% cellulysin (Calbiochem) and 0.3% macerase (Calbiochem) in the enzyme 

solution. The pH was adjusted to 5.5 and filter sterilized. 

The cells were centrifiiged at 800 rpm for 5 min. The supernatant was discarded. About 

40 ml of wash solution was added, cells were resuspended and were centrifuged at 800 rpm for 
25 5 min. The supernatant was discarded. The cells were then resuspended in three volumes of 

enzyme digestion solution, and incubated for 60 min. at room temperature. 

Washing: The cells were transferred into 50 ml plastic tube and centrifuged at 800 rpm 

for 5 min. The supernatant was discarded. The cells were resuspended in 40 ml of wash 

solution and centrifuged at 800 rpm for 5 min. The supernatant was discarded. The cells were 
30 resuspended in 40ml of electroporation buffer and centrifuged at 800 rpm for 5 min. The 

supernatant was discarded. The cells were resuspended in four volumes of electroporation 

buffer. 

Electroporation: One ml of cells containing the RNA or DN A inocula was transferred 
into electroporation cuvettes and placed on ice for 10 min. The cells were then mixed and 
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electroporated at 500 microF, 250V. The cuvettes were placed on ice for 10 min. The cells were 
transferred into 10 ml ofNTl plating media. 

Incubation and collection of samples: The cells were incubated at room temperature in 
dark. Samples were collected 24-48 hrs post inoculation. 

RNA Analysis: RNA extraction, denaturing 1 % agarose gel electrophoresis and Northern 
blot hybridization were performed by known methods, such as that performed in Rasochova and 
Miller (1996). Each lane was loaded with equal amounts (approx. 5 £ig) of total RNA as 
determined by spectrophotometry and confirmed by ethidium bromide staining of ribosomal 
RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in hybridization 
buffer was used per hybridization experiment. Replication of RNA3 was confirmed by detection 
of sgRNA4, thus showing that BMV RNA replication factors la and 2a expressed from 
expression plasmid(s) support efficient replication of RNA3 supplied as in vitro transcript 
(Figure 1 1 ) as well as launched from DNA-launching platform (Figure 12). 

Example 4 — Production of Transgenic K tabacum Plants 

Once a desired molecule was constructed in E. coli y the molecule was transferred into 
Agrobacterium tumefaciens by the freeze-thaw method. Vectors pBl LR2, pB2LR4, pB12LR6, 
and pB12LR7 were all individually used. An Agrobacterium strain LBA 4404 containing an 
appropriate helper Ti plasmid was grown in 5 ml of YEP medium overnight at 28°C. Two ml 
of the overnight culture were added to 50 ml YEP medium in a 250-ml flask and shaken 
vigorously (250 rpm) at 28 °C until the culture grew to an OD 500 of 0.5 to 1 .0. The culture was 
chilled on ice. The cell suspension was centrifuged at 3000 g for 5 min. at 4°C. The supernatant 
solution was discarded. The cells were resuspended in 1 ml of ice-cold 20 mM CaCl 2 solution. 
0.1 -ml aliquots were dispensed into prechilled eppendorf tubes. About 1 jzg of plasmid DNA 
was added to the cells. The cells were frozen in liquid nitrogen. The cells were thawed by 
incubating the test tube in a 37°C water bath for 5 min. 1 ml of YEP medium was added to the 
tube and incubated at 28 °C for 2-4 h with gentle shaking to allow the bacteria to express the 
antibiotic resistance genes. The tubes were centrifuged for 30 s and the supernatant solution was 
discarded. The cells were resuspended in 0.1 ml YEP medium, plated on a YEP agar plate 
containing selection antibiotics), and incubated at 28 °C Transformed colonies appeared in 2-3 
days. 

In vitro clonal copies of approximately three week old Nicotina tabacum, Wisconsin No. 
38, were used as the source of explants. Leaf explants were prepared from the second and third 
fully expanded leaves of in vitro cultures. The leaf pieces were cut into 1 cm x 1 cm squares and 
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placed upon TBI (plus 2.0 mg/1 6-benzyl-aminopurine, and 0.1 mg/1 -naphthalene acetic acid) 
media for 24 hours at 25 °C with a 16 hour photo period. 

Agrobacterium tumefaciens strain LB A 4404 containing the preselected binary vector 
was. used for plant transformation! Explants were placed in -10 ml of overnight grown 
5 Agrobacterium culture for 30 min. Leaf explants were then blotted on filter paper and placed 
on TB2 (plus 1 .0 mg/1 6-benzyl-aminopurine and 0.1 mg/1 -naphthalene acetic acid) media for 
4 days, abaxial side down. Explants are then rinsed three times in sterile water, blotted on filter 
paper, and placed on TB2 media for regeneration with 100 mg/1 kanamycin and 400 mg/1 
carbenicillin at 25°C, 16 hour photo period, abaxial side down. Explants were transferred to 

1 0 fresh TB2 media with 1 00 mg/1 kanamycin and 400 mg/1 carbenicillin every 1 0 to 1 4 days until 
plantlets developed. Plantlets typically developed at 10-14 days. Plantlets were cut from the 
callus and placed on MST media containing 100 mg/1 kanamycin and 400 mg/1 carbenicillin to 
induce rooting. Rooted plants were transferred to soil. 

TBI (1 liter) included 4.30 g MS salts, 100 mg myo-inositol, 1 .0 ml Nitsch and Nitsch 

15 vitamins, 30 g sucrose, 2 mg BAP, 0.10 mg of NAA, and 8g Noble agar. The media was 
adjusted to a pH 5.7 and autoclaved. 

TB2 (1 liter) included 4.30 g MS salts, 100 mg myo-inositol, 1.0 ml Nitsch and Nitsch 
vitamins, 30 g sucrose, 1.0 mg BAP, 0.10 mg NAA, and 8 g Noble agar. The media was 
adjusted to pH 5.7 and autoclaved. 

20 MST ( 1 liter) included 4.30 g MS salts, 1 .0 ml Nitsch and Nitsch vitamins, 30 g sucrose, 

1 00 mg myo-inositol, and 8.5 g Difco agar. The media was adjusted to pH 5.7 and autoclaved. 

YEP (100 ml) included l.Og Bacto-peptone, 1.0 g Bacto-yeast extract, and 0.5 gNaCl. 
The media was autoclaved. 

RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 

25 Northern blot hybridization was performed by known methods, such as that performed in 
Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 Mg) of total 
RNA as determined by spectrophotometry and confirmed by ethidium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 
hybridization buffer was used per hybridization experiment. Figure 13a shows the successful 

30 expression of BMV 1 a and 2a mRNA in transgenic N. tabacum. 

pim ple S - Transecti on of Transgenic N. tabacvm Plants with DNA-1f»mchins Platfprm 

Precipitation o/DNA onto Microcarriers for Particle Bombardment: (Kikkert, 1993). 
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Sterilization of Microcarriers: 80 mg of gold microcarriers were resuspended in 1 ml 
of 70% ethanol, soaked for 15 min., and centrifiiged at 13,000 x g for 5 min. The supernatant 
was carefully removed and discarded. Particles were resuspended in 1 ml of sterile distilled, 
deionized water and centrifiiged at 13,000 x g for 5 min. The supernatant was carefully removed 
5 and discarded. Water washing of particles was repeated 2 more times. After final rinse, 
particles were resuspended in 1 ml of sterile 50% glycerol. 

Coating Microcarriers with DNA: The following was sequentially and quickly added: 
SfA DNA (lAig/^O, 50/J of 2.5M CaCl 2 , and 20^1 of 0.1M Spermidine. 

The mixture was incubated for 10 min. on a vortex shaker at room temperature. 

10 Particles were pelleted by centrifugation at 13,000 x g for 5 sec. Supernatant was carefully 
removed and discarded. Particles were resuspended in 140 fA of 70% ethanol and centrifuged 
at 13,000 x g for 5 sec. Supernatant was removed and discarded. Particles were resuspended 
in 1 40 fA of 1 00% ethanol and centrifuged at 1 3,000 x g for 5 sec. Supernatant was removed and 
discard. Particles were resuspended in 50yul of 100% ethanol. 

15 Young leaves from tobacco plants grown in vitro on agar-solidified MS medium 

containing 30g/liter sucrose, were bombarded with 5-fA aliquots of resuspended DNA-coated 
particles using a PDSlOOOHe biolistic gun (DuPont) and 1 100 psi rupture disks (Bio-Rad). 

RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 
Northern blot hybridization was performed by known methods, such as that performed in 

20 Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 fzg) of total 
RNA as determined by spectrophotometry and confirmed by ethidium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 
hybridization buffer was used per hybridization experiment. Figure 14a shows that the launched 
BMV RNA3 replicates efficiently in transgenic plants expressing BMV replication factors la 

25 and 2a and that the launched RNA3 is unable to replicate in the absence of BMV la and/or 2a. 

Example 6 - Production of Transgenic N benthamiana Plants 

Once a desired molecule was constructed in E. coli, the molecule was transferred into 
Agrobacterium tumefaciens. Vectors pBlLR2, pB2LR4, pB12LR6, and pB12LR7 were all 
30 individually used. An Agrobacterium strain LBA 4404 containing an appropriate helper Ti 
plasmid was grown in 5 ml of YEP medium overnight at 28°C. Two ml of the overnight culture 
were added to 50 ml YEP medium in a 250-ml flask and shaken vigorously (250 rpm) at 28 °C 
until the culture grew to an OD500 of 0.5 to 1.0. The culture was chilled on ice. The cell 
suspension was centrifuged at 3000 g for 5 min. at 4°C. The supernatant solution was discarded. 
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The cells were resuspended in 1 ml of ice-cold 20 mM CaCl 2 solution. 0.1-ml aliquots were 
dispensed into prechilled eppendorf tubes. About 1 Mg of plasmid DNA was added to the cells. 
The cells were frozen in liquid nitrogen. The cells were thawed by incubating the test tube in 
a 37°C water bath for 5 min. 1 ml of YEP medium was added to the tube and incubated at 28°C 
5 for 2-4 h with gentle shaking to allow the bacteria to express the antibiotic resistance genes. The 
tubes were centrifuged for 30 s and the supernatant solution was discarded. The cells were 
resuspended in 0.1 ml YEP medium. The cells were plated on a YEP agar plate containing 
selection antibiotic(s) and incubated at 28 °C. Transformed colonies appeared in 2-3 days. 

In vitro clonal copies of approximately five-seven weeks old N. benthamiana were used 

10 as the source of explants. Leaf explants were prepared from the second and third fully expanded 
leaves of in vitro cultures. The leaf pieces were cut into 1cm x 1cm squares and placed upon 
MS 104 media in 100 x 15 mm plates for 24 hours at 23 °C with a 16 hour photo period. 

Agrobacterium tumefaciens strain LBA 4404 containing the preselected binary vector 
was used. Explants were placed in - 10ml of overnight grown Agrobacterium culture for 30 min. 

1 5 Leaf explants were then blotted on filter paper and placed abaxial side down on MSI 04 media 
for 4 days. Explants were then rinsed three times in sterile water, blotted on filter paper, and 
placed on MS 1 04 media for regeneration with 300 mg/L kanamycin and 400 mg/L carbenicillin. 
Explants were transferred to fresh MS104 media with 300 mg/L kanamycin and 400 mg/L 
carbenicillin every 10-14 days until plantlets developed. Plantlets typically developed at 31-50 

20 days. Plantlets were cut from the callus and placed on MST media plus 300 mg/L kanamycin 
and 400 mg/L carbenicillin to induce rooting. Rooted plants were transferred to soil. 

One liter of MS 104 included 4.3 g MS salt mixture, 1 .0 ml B5 vitamin solution, 30 g 
sucrose, 1 .0 mg BA, 0.1 mg NAA , and 8.0 g Phytagar. The media was adjusted to pH 5.8 and 
autoclaved. 

25 1 00 ml of YEP included 1 .0 g Bacto-peptone, 1 .0 g Bacto-yeast extract, 0.5 g NaCl. The 

media was autoclaved. 

One liter of MST included 43 g MS salt mixture, 1 .0 ml Nitsch & Nitsch vitamins, 30 
g sucrose, 100 mg myo-inositol , and 8.5 g Phytagar. The media was adjusted to pH 5.7 and 
autoclaved. 

30 RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 

Northern blot hybridization was performed by known methods, such as that performed in 
Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 Mg) of total 
RNA as determined by spectrophotometry and confirmed by ethidium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 
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hybridization buffer was used per hybridization experiment. Figure 13b shows the successful 
expression of BMV la and 2a mRNA in transgenic N. benthamiana. 

Example 7 — Transfection of Transgenic K benthamiana Plants 
5 Precipitation of DNA onto Microcarriers for Particle Bombardment: (From Kikkert 

(1993) "The biplistic PDS 1000/He device", Plant Cell Tiss. And Org. Cult. 33:221-226) 

Sterilization of Microcarriers: 80 mg of gold microcarriers were resuspended in 1 ml 
of 70% ethanol, soaked for 15 min., and centrifuged at 13,000 x g for 5 min. The supernatant 
was carefully removed and discarded. Particles were resuspended in 1 ml of sterile distilled, 
1 0 deionized water and centrifuged at 1 3,000 x g for 5 min. The supernatant was carefully removed 
and discarded. Water washing of particles was repeated 2 more times. After final rinse, 
particles were resuspended in 1 ml of sterile 50% glycerol. 

Coating Microcarriers with DNA: To the 50 *zl of particles the following was 
sequentially and quickly added: 5/^1 DNA (l^g/^O, 50/J of 2.5M CaCl 2 , and 20^1 of 0.1M 
1 5 Spermidine. 

The mixture was incubated for 10 min. on a vortex shaker at room temperature. 
Particles were pelleted by centrifugation at 13,000 x g for 5 sec. Supernatant was carefully 
removed and discarded. Particles were resuspended in 140 jA of 70% ethanol and centrifuged 
at 13,000 x g for 5 sec. Supernatant was removed and discarded. Particles were resuspended 

20 in 1 40 /zl of 1 00% ethanol and centrifuged at 13,000 x g for 5 sec. Supernatant was removed and 
discarded. Particles were resuspended in 50^1 of 100% ethanol. 

Young leaves from N. benthamiana plants grown in vitro on agar-solidified MS medium 
containing 30g/liter sucrose, were bombarded with 5-^1 aliquots of resuspended DNA-coated 
particles using a PDSlOOOHe biolistic gun (DuPont) and 1 100 psi rupture disks (Bio-Rad). 

25 RNA Analysis: Total RNA extraction, denaturing 1% agarose gel electrophoresis and 

Northern blot hybridization was performed by known methods, such as that performed in 
Rasochova and Miller (1996). Each lane was loaded with equal amounts (approx. 5 Mg) of total 
RNA as determined by spectrophotometry and confirmed by ethtdium bromide staining of 
ribosomal RNA before Northern blot hybridization. 1 X 10 6 cpm/ml of radioactive probe in 

30 hybridization buffer was used per hybridization experiment. The launched BMV and RNA 3 
showed efficient replication (Figure 14b) in transgenic N. benthamiana plants expressing BMV 
replication factors la and 2a and was unable to replicate in the absence of BMV la and/or 2a. 
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Example 8 - Transection of Tr »™f» T" Plants with GUS Containing DNA-launching Platform 
Transgenic N. tabacum and N. benthaniana plants were produced according to the 
procedures discussed above. The plants were transfected with a DNA-launching platform 
containing a GUS gene (Figure 5a) by particle bombardment as described in Examples 5 and 7. 

5 The plants were incubated for 3-5 days and then assayed for p-glucuronidase (GUS) activity 
using 1 mg/ml X-Gluc (5-bromo-4-chloro-3-indolyl glucucuronide) as substrate in 0.1M 
potassium phosphate buffer, pH 7.0, 50 /zM potassium ferrocyanide, and 2% Triton® X-100. 
Following an overnight incubation at 37°C, cells replicating launched RNA3 derivatives and 
expressing the GUS reporter gene from a subgenomic RNA4 gave rise to blue spots (Figure 15). 

10 The launched RNA3 derivative did not replicate and express GUS reporter gene in the absence 
of BMV RNA replication factors la and 2a {e.g., in wt N. benthamiana and in wt N. tabacum). 

pv am p lP 0 - Transferor, of Trans o nic Plants Expressing BMV 1 a 2a, 3a, and CP 

A plant is transformed with BMV la, 2a, 3a, and CP genes whereby those genes are 
15 stably expressed in said plant. This can be done with the procedures outlined above. Any 
modifications that would be needed would be readily apparent to those skilled in the art in light 
of the teachings contained herein. A DNA-launching platform encoding an RNA replicon which 
contains a foreign gene and necessary BMV or CCMV cis-acting replication signals to replicate 
said replicon is constructed (Figure 10b). Foreign genes to be included in said replicon could 
20 include, for example, a Bacillus thwingiemis polynucleotide that codes for a B.t. protein. Other 
sequences would include, e.g., sequences that encode herbicide resistance, or any other known 
sequence that encodes peptides or proteins having desired qualities in plants. 

Alternatively, plants can be transformed to express BMV la, 2a, 3a, and a TMV coat 
protein in place of the BMV coat protein. A DNA-launching platform is then made containing 
25 one or more foreign genes and the necessary cis-acting replication signals, either BMV or 
CCMV, and a TMV origin of assembly (Figures 8a, 8b, and 10a). This launching platform 
provides a distinct advantages TMV is a rod-shaped virus which has no strict limit on the size 
of RNA that can be encapsidated. Alternatively, TMV movement protein can be used in place 
of BMV3a (Figure 7c). Hybrids between tobamo and bromoviruses were shown to be viable 
30 (Sacher et al., 1988; De Jong and Ahlquist, 1992). 

Other permutations and combinations of genes pretransformed and those included in the 
DNA-launching platform will readily be appreciated by the skilled artisan in light of the 
teachings herein. (See, e.g., Figures 8c, 10b, and 10c). 
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As indicated above, CCMV subgenomic promoter can be substituted for BMV 
sequences in a desired DNA launching platform. Because the sequence of CCMV subgenomic 
promoter differs from the sequence of BMV subgenomic promoter, the probability of 
recombination that would result in loss of a foreign gene is much lower in a construct having a 
5 combination of these two different promoters. 

In the above examples, trans-acting components may include, but are not limited to, 
replication factors, components responsible for cell to cell movement, or components such as 
the coat protein which may be required for long distance spread, viral proteases responsible for 
post translational processing, or other known trans-acting functions. 

10 

Example 10 - Transfection of AT. tabacum Protoplasts with GU S Containing DNA-Launching 
Platforms 

A', tabacum protoplasts isolated using the above described methods were inoculated by 
electroporation with DNA-launching platforms for BMV RNA3 derivatives in the presence or 

1 5 absence of la and 2a expression plasmids. BMV RNA3 derivatives contained the GUS gene in 
place of the coat protein ORF (Figure 5a) (these were inoculated with or without coat protein 
expression plasmid, Figure 5b), or had the BMVCP gene translated from an additional 
subgenomic RNA driven from BMV or CCMV subgenomic promoter (Figures 5c and 5d), or 
had the SHMV coat protein translated from an additional BMV subgenomic RNA (Figure 7b). 

20 Protoplasts were collected by centrifugation (800 rpm, 5 min.) 24 hours post inoculation. The 
chemiluminescent GUS assay was performed using GUS-Light™ (Tropix, MA) according to 
manufacturer's instructions. Protein concentrations were determined using the Bio-Rad protein 
kit (Bio-Rad Laboratories, Hercules, CA). The GUS values, determined by luminometer, were 
adjusted to the same total protein concentration. Figures 16a and 16b show successful GUS 

25 expression in protoplasts in the presence of trans-acting BMV replication factors la and 2a. 

Example 1 1 - Transfection of N. tabacum Protoplasts with GFP Containing DNA-Launchipg 
Platform 

N. tabacum protoplasts isolated by using the above described methods were transfected 
30 by electroporation with expression plasmids for trans-acting BMV replication factors 1 a and 2a 
and with DNA-launching platforms for RNA3 derivatives having the GFP gene in place of BMV 
coat protein ORF (Figure 6e), the CP gene translated from an additional subgenomic RNA 
(Figure 6a) or with an RNA transcript having the GFP expressed as a fusion protein with BMV 
3a ORF (Figure 6d). Protoplasts were incubated for 24 hrs and examined for GFP expression 
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using a fluorescent microscope. Figure 18 shows the successful expression of GFP in 
protoplasts. - 

Example 12 - Transfertinn of Ha + ^Transgenic Plants with BMV RNA3-Based PNA- 
5 I annching Platform Containing GFP 

N. benthamiana plants were transfected using a particle bombardment as described 
above with a DNA-launching platform for BMV RNA3 having the GFP gene in place of BMV 
coat protein (Figure 6e). The GFP expression was determined 24 hrs post inoculation using a 
fluorescent microscope. Figure 17 shows the successful expression of GFP in (la + 2a)- 
1 0 transgenic N. benthamiana. 

Example 13 - Transection of (Is* + 2aVTrans P «nic N. henthamiana with BMV RNA3 DNA- 
Kaunchine Platform Using A ^rnbacterium 

N. benthamiana plants were inoculated with BMV RNA3 DNA-launching platform 

1 5 using Agrobacterium tumefaciens. Once the desired construct (pB3LR42) was obtained in E. 
coli it was transferred to A. tumefaciens strain LBA4404 using a thaw-freeze method as 
described above. The Agrobacterium was grown overnight in 28 °C under constant shaking. A 
single lower leaf of N. benthamiana were punctured with a needle multiple times and submerged 
in Agrobacterium culture. The plants were grown at 23 °C with a 16 hr photoperiod. The 

20 inoculated leaves were harvested 14 days post-inoculation. The total RNA extraction and 
northern blot hybridization were performed as described above. Figure 19 shows replication of 
launched BMV RNA3 in inoculated (la + 2a)-transgenic N. benthamiana. 

Example 14 - Transfertinn of Ha + ^Transgenic Plants with BMV RNA3-Based PNA- 

25 Launching Platform Containing GU S and SHMV Coat Protein 

N. benthamiana plants were transfected using a particle bombardment as described 
above with a DNA-launching platform for BMV RNA3 wherein the BMV coat protein was 
replaced with the SHMV coat protein (Sunn-hemp mosaic virus) and the GUS gene was inserted 
downstream of an additional BMV subgenomic promoter (Figure 7b). The GUS expression was 

30 determined by histochemical GUS assay described above. Figure 20 shows the successful 
expression of GUS in (la + 2a)-transgenic plants. 
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Example 1 5 - Movement of Launched BMV RNA 3 

Fl progeny plants from self-fertilized (la+2a)-transgenicW: benthamiana BP14 Were 
inoculated with BMV RNA3 DNA launching platform using Agrobacterium tumefaciens: 
Seedlings were germinated on Smurf media containing Kanamycin. Plants were grown at 23 °C 
5 with a 16 hr photoperiod. Once the desired construct (pB3LR42) was obtained in E. coli it was 
transferred to A. tumefaciens strain LB A4404 using a thaw-freeze method as described above. 
The Agrobacterium was grown overnight at 28°C under constant shaking. A single lower leaf 
of M benthamiana was punctured with a needle multiple times and submerged in Agrobacterium 
culture. The inoculated, middle, and upper leaves were harvested 14 days post-inoculation. 
10 Total RNA extraction and northern blot hybridization were performed as described above. 
RNA3 replication was detected in all leaves tested (Fig. 21). It shows that BMV RNA3 is able 
to replicate, move cell-to-cell and spread long distance in (la+2a>transgenic plants. 

F.xample 16 - Transection of Pr o geny From (la+2aVTransf?enic K benthamiana With BMV 

15 RNA3 DNA-Launching Platform 

Progeny plants from self-fertilized (la+2a)-transgenic N. benthamiana (designated 
BP 14) were inoculated with BMV RNA3 DNA-launching platform using Agrobacterium as 
described in Example 13. Control plants (non-transgenic K benthamiana) were inoculated with 
the sap from BMV infected barley using inoculation buffer composed of 50mM NaP0 4 , pH7.0, 

20 and 1% celite. Root samples were harvested 6 weeks post inoculation. RNA extraction and 
northern blot hybridization were performed as described above. Figure 22 shows that BMV 
RNA3 replicated to very high levels in roots. In some (la+2a)-transgenic plants (Figure 22, 
lanes 2, 5, 6, 7, 8, 10) replication of launched RNA3 dramatically exceeded replication of wild- 
type BMV in non-transgenic N. benthamiana plants (Figure 22, lane 1). This shows that this 

25 system can be used for delivery of RNA, proteins, peptides or other compounds to roots and 
enables testing of such compounds for various activities, for example, activities directed against 
root parasites. For example, proteins with anti-nematode activities can be inserted into RNA3 
DNA-launching platform using the above described strategies and expressed in roots upon 
RNA3 replication. Such proteins can be engineered to be expressed in the cytoplasm or 

30 alternatively secreted into the surrounding soil. 



W0 99/61597 PCT/US99/11250 

25 

Example 1 7 ^ Barley S tripe Mosaic Virus 

Barley; stripe mosaic virus (BSMV) has a tripartite genome (RNA alpha, beta, and 
gamma). These genomic RN As have an m7Gppp cap at the 5 ' end and a t-RNA like structure 
at the 3' end (Jackson and Hunter, 1989). 
5 A DNA-launching plasmid for BSMV RNA alpha, RNA beta, and RNA gamma 

containing BSMV RNA cDNA is constructed by precisely fusing at its 5' end to a DNA- 
dependent RNA polymerase promoter and to a self-cleaving ribozyme at its 3' end. A 
polyadenylation signal may be also included. Alternatively, a convenient restriction site may 
be engineered at the 3' end of viral cDNAs. Foreign genes or sequences may be expressed in 

1 0 several ways. For example, DNA-launching plasmids based on BSMV RNA beta may contain 
a foreign gene or sequence expressed in place of ORF beta a. 

Transgenic plants having one or more trans-acting factors fused to the DNA-dependent 
RNA polymerase promoter and terminator are obtained. Such trans-acting factors may include 
parts of the viral RNA replicase (ORFs alpha a and/or gamma a) or other trans-acting factors. 

1 5 The trans-acting factors are stably expressed in the plant cell or their expression may be induced 
if an inducible promoter is used. Cis-acting sequences necessary for BSMV RNA replication 
are removed from transgenes. Alternatively, the full-length RNA alpha is expressed from the 
chromosome. Alternatively, ORF gamma a including the 5' untranslated region and ORF 
gamma b from a seed transmitted strain, such as ND18, are also expressed (Edwards, 1995). 

20 A DNA-launching plasmid is constructed containing the DNA-dependent RNA 

polymerase promoter precisely fused to the 5' end of the BSMV RNA beta, cis-acting elements 
important for BSMV RNA beta life cycle, such as the 5' and 3' ends, the intercistronic region 
between the beta a and beta b ORFs (Zhou and Jackson, 1996) and a foreign gene or sequence 
in place of ORF beta a (coat protein) which is dispensable for BSMV replication and movement 

25 (Petty and Jackson, 1 990). Such DNA-launching plasmids may lack the internal poly(A) region 
as this region is dispensable for replication and contain a ribozyme or a convenient restriction 
site at the 3' end of the modified viral RNA. Alternatively, a DNA-launching plasmid is 
constructed from RNA gamma in which ORFs gamma a and/or gamma b are replaced with 
foreign genes or sequences which may also include the triple gene block genes (ORFs beta b, 

30 beta c, and beta d) or a heterologous movement protein (TMV 30K, RCNMV 35K). 



F.xample 1 8 - Tobacco Mosaic Virus 

Tobacco mosaic virus (TMV) has a single-stranded positive sense RNA genome. 
5' end has an m7Gppp cap and the 3' end contains a t-RNA like structure. 
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A DNA-launching plasm id is constructed based on TMV RNA containing TMV cDNA 
precisely fused at its 5' end to a DNA-dependent RNA polymerase promoter and at its 3' end 
to a self-cleaving ribozyme. A polyadenylation signal may be also included. Alternatively, a 
convenient restriction site may be engineered at the 3 ' end. Foreign gene may be expressed from 
5 an additional subgenomic RNA by including an additional subgenomic RNA promoter on the 
(-) strand. 

Transgenic plants are obtained having one or more trans-acting factors fused to the 
DNA-dependent RNA polymerase promoter and terminator. Such factors may include the viral 
replicase (126K/183K), movement protein (30K), or coat protein (17.6K). At least one cis- 

10 acting sequence necessary for TMV RNA replication is removed from transgenes. The trans- 
acting factors are stably expressed in the plant cell or their expression may be induced if an 
inducible promoter is used. 

A DNA-launching plasmid is constructed containing the DNA-dependent RNA 
polymerase promoter precisely fused to the 5' end of the TMV cDNA, cis-acting elements 

1 5 important for the TMV life cycle, such as the 5 ' and 3 ' ends, origin of assembly, etc., at least one 
foreign gene or sequence in place of the trans-acting factor that is expressed from the 
chromosome, and a ribozyme or a convenient restriction site at the 3' end. Alternatively, the 
foreign gene sequence can be expressed from an additional subgenomic RNA promoter and the 
sequence coding for the trans-acting factor that is expressed from the transgene can be deleted 

20 from the DNA-launching plasmid. Preferably, if the viral replicase proteins are expressed in 
transgenic plants, the DNA-launching plasmid will have a deletion of nucleotides 3420-4902, 
which appears to be a region that inhibits replication in trans. (Lewandowski et al y 1 998). 



Example 1 9 — Potato Virus X 
25 Potato virus X (PVX) has a single-stranded positive sense RNA genome. The 5 ' end has 

an m7Gppp cap and the 3' end is polyadenylated. A full-length cDNA clone of PVX has been 

constructed and infectious RNA transcripts obtained (Hemenway et al. 9 1990). 

A DNA-launching plasmid is constructed based on PVX RNA containing PVX cDNA 

precisely fused at its 5' end to a DNA-dependent RNA polymerase promoter and having a 
30 polyadenylation site at its 3' end. A convenient restriction site may also be included at the 3' 

end. A foreign gene may be expressed from an additional subgenomic RNA. 

Transgenic plants are obtained having one or more trans-acting factors fused to the 

DNA-dependent RNA polymerase promoter and terminator. Such factors may include the viral 

RNA polymerase gene (ORF1-147K), coat protein (ORF5-21K), or triple gene block (ORF2- 
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25K, ORF3-12K, ORF4-8K). The triple gene block genes can be expressed individually. 
Alternatively, they can be expressed as negative sense transcripts from which plus sense 
subgenomic RNA for ORFs 2, 3, and 4 can be transcribed by the viral replicase. Such transgene 
will have a DNA-dependent RNA polymerase promoter fused to sequence of ORFs 2, 3, and 4 

5 in the minus sense orientation and the transcribed sequence will include a subgenomic RNA 
promoter. At least one cis-acting sequence necessary for PVX RNA replication is removed from 
transgenes. The trans-acting factors are stably expressed in the plant cell or their expression may 
be induced if an inducible promoter is used. 

A DNA-launching plasmid is constructed containing the DNA-dependent RNA 

10 polymerase promoter precisely fused to the 5' end of the PVX genome, cis-acting elements 
important for PVX life cycle, such as the 5' and 3' ends, origin of assembly, etc., at least one 
foreign gene or sequence in place of the trans-acting factor that is expressed from the 
chromosome and a polyadenylation signal. Alternatively, the foreign gene sequence can be 
expressed from an additional subgenomic RNA promoter and the sequence coding for the trans- 

1 5 acting factor that is expressed transgenicaliy can be deleted from the DNA-launching plasmid. 

Alternatively, a DNA-launching plasmid is constructed having a DNA-dependent RNA 
polymerase promoter, polyadenylation site, and the PVX cDNA sequence in which the ORF2 
(25K) is replaced with a foreign gene or sequence. Alternatively, the ORF2 is deleted and the 
foreign gene is expressed from an additional subgenomic RNA promoter. Such a DNA- 

20 launching plasmid is inoculated to transgenic plants expressing movement protein from 
heterologous virus, such as tobacco mosaic virus (TMV 30K), tomato mosaic virus (ToMV 
30K), or red clover necrotic mosaic virus (RCNMV 35K). 

Example 20 - Flock House Virus 
25 Flock house virus (FHV) has a genome consisting of two single stranded RNAs. RNA1 

encodes protein A, involved in RNA replication, and protein B that is translated from sg RNA3 
and is dispensable for RNA replication. RNA2 encodes virion capsid precursor protein alpha. 
FHV is infectious to insect, plant, mammalian, and yeast cells (Selling et al., 1990; Price et al, 
1996). 

30 A DNA-launching plasmid is constructed for FHV RNA 1 and RNA2 containing FHV 

RNA cDNA precisely fused at its 5' end to a DNA-dependent RNA polymerase promoter and 
at its 3' end to a self-cleaving ribozyme. A polyadenylation signal may be alsojncluded. 
Alternatively, a convenient restriction site may be engineered at the 3' end. Foreign genes or 
sequences may be expressed in several ways. For example, DNA-launching plasmids based on 
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FHV RNA1 may contain a foreign gene or sequence expressed from subgenomic RNA3 as ORF 
B replacement or as a translational fusion with ORF B. Alternatively, a foreign gene may be 
expressed from an additional sg RNA. DNA-launching piasmids based on FHV RNA2 may 
contain a foreign gene(s) or sequence(s) expressed as a part of polyprotein alpha. Foreign 
5 gene(s) in such construct may include sequences necessary for polyprotein clevage. DNA- 
launching piasmids will preferably also express a movement protein of a heterologous plant 
virus, such as 30K of TMV or 35K of RCNMV. Alternatively, DNA-launching piasmids will 
be inoculated onto transgenic plants expressing such movement protein. 

Transgenic plants are obtained having one or more trans-acting factors fused to the 

1 0 DN A-dependent RNA polymerase promoter and terminator. Such factors may include protein 
A or capsid protein precursor alpha, and preferably will also include a movement protein from 
a plant virus, such as 30K of TMV or 35K of RCNMV. Trans-acting factors are stably 
expressed in the plant cell or their expression may be induced if an inducible promoter is used. 
Transgenically expressed trans-acting factors preferably lack at least one cis-acting factor which 

1 5 is necessary for their replication, such as the 5' and/or 3' end. 

A DNA-launching plasmid is constructed based on FHV RNA1 or FHV RNA2 
containing a DNA-dependent RNA polymerase promoter precisely fused to the 5' end of RNA1 
(or RNA2), cis-acting elements important for FHV RNA1 (or RNA2) replication, such as the 5' 
and 3' ends, at least one foreign gene or sequence and a self-cleaving ribozyme at the 3' end. 

20 Polyadenylation signal may also be included. Alternatively, a convenient restriction site may 
be engineered at the 3 ' end of the modified viral RNA sequence of the DNA-launching plasmid. 
DNA-launching piasmids based on FHV RNA1 may contain a foreign gene or sequence in place 
of ORF A. Alternatively, the ORF A may be deleted and the foreign gene may be expressed 
from subgenomic RNA3, for example as an ORF B replacement or as a translational fusion with 

25 ORF B. Alternatively, DNA-launching plasmid may contain two exogenous RNA sequences, 
one in the place of ORF A and the other expressed from the subgenomic RNA3. DNA- 
launching piasmids based on FHV RNA2 may contain a foreign gene(s) or sequence(s) in place 
of ORF alpha or expressed as a part of polyprotein alpha. Foreign gene(s) in such a construct 
may include sequences necessary for polyprotein clevage. 

30 

Example 2 1 - Tomato Spotted Wild Virus 

Tomato spotted wild virus (TSWV) is a tripartite (RNA L, M, S), negative sense and 
ambisense, single stranded RNA virus. 
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Transgenic plants are obtained having one or more trans-acting factors fused to the 
DNA-dependent RNA polymerase promoter and terminator. Such factors include the putative 
TSWV polymerase gene (ORF L), ORF N, and possibly other trans-acting factors (NSm orNSs). 
At least one cis-acting sequence, such as 5' and/or 3' ends, which are necessary for TSWV RNA 
5 replication are removed from the transgene. Trans-acting factors are stably expressed in the 
plant cell or their expression may be induced if an inducible promoter is used. 

A DNA-launching plasmid is constructed based on TSWV RNA M in which the Gl and 
G2 coding sequences are replaced with at least one foreign gene or sequence. Such DNA- 
launching plasmid contains a DNA-dependent RNA polymerase promoter and TSWV RNA M 
10 cDNA fused to the self-cleaving ribozymes at the 5' and 3' ends. Alternatively, a DNA- 
launching plasmid is constructed based on TSWV RNA S in which the N coding region is 
replaced with a foreign gene or sequence. 

Example 22 - Barley Mild Mosaic Virus 

15 Genome of barley mild mosaic virus (BaMMV) consists of two positive sense, single- 

stranded, 3'-polyadenylated RNAs. The RNA1 encodes proteins related to the potyviral P3, 
6K U CI, 6K2, NIa-VPg, NIa-Pro, Nib and capsid protein (Kashiwazaki et al , 1 990). The RNA2 
encodes PI and P2 protein (Kashiwazaki et al, 1991). The PI protein is related to the potyviral 
HC-Pro and the P2 protein is important for fungal transmission. An isolate was obtained 

20 containing a deletion in the P2 protein (Timpe and Kuhne, 1995) thus indicating that P2 is 
dispensable for viral RNA replication. 

A DNA-launching plasmid is constructed for BaMMV RNA1 and RNA2 containing 
BaMMV RNA cDNA precisely fused at its 5' end to a DNA-dependent RNA polymerase 
promoter and a polyadenylation site at its 3' end. Foreign genes or sequences may be expressed 

25 in several ways. For example, DNA-launching plasmids based on BaMMV RNA2 may contain 
a foreign gene or sequence expressed as a part of polyprotein which can be cleaved and a foreign 
protein can be released. 

Transgenic plants are obtained having the BaMMV RNA 1 cDN A lacking the 5 ' and 3 ' 
ends fused to the DNA-dependent RNA polymerase promoter and terminator. 

30 A DNA-launching plasmid is constructed based on BaMMV (isolate M) RNA2. Such 

plasmid contains a DNA-dependent RNA polymerase promoter precisely fused to the 5' end of 
RNA2, RNA2 cis-acting replication signals located in the 5' and 3' ends, PI ORF and a foreign 
gene in place of P2 ORF or expressed as a part of P1/P2 polyprotein which can be cleaved and 
a foreign protein can be released. 
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The contents of all references cited throughout are incorporated herein by this reference 
to the extent they are not inconsistent with the disclosure, teachings, and principles of the subject 
invention. 

It should be understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of this 
application. 
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Claims 



1 1 . A DNA-launching platform comprising: 

2 a) a polynucleotide molecule encoding a modified viral RNA molecule; and 

3 b) a DNA dependent RNA polymerase promoter. 

1 2. The DNA-launching platform of claim 1 further comprising a sequence encoding at 

2 least one cis-acting element. 

1 3. The DNA-launching platform of claim 1 further comprising a ribozyme sequence. 

1 4. The DNA-launching platform of claim 1 further comprising a termination sequence. 

1 5. The DNA-launching platform of claim 1 further comprising a restriction site. 

1 6. The DNA-launching platform of claim 1 wherein said modified RNA molecule 

2 comprises an exogenous RNA segment. 

1 7. The DNA-launching platform of claim 1 wherein said DNA dependent RNA 

2 polymerase promoter is capable of functioning in a plant cell. 

1 8. A method of genotypically or phenotypically modifying one or more cells comprising 

2 the following steps: 

3 a) obtaining a DNA-launching platform comprising a polynucleotide molecule encoding 

4 a modified viral RNA; and 

b) transfecting said one or more cells with said DNA-launching platform, wherein said 
polynucleotide molecule is transcribed thereby forming a replicatable RNA transcript. 

1 9. The method of claim 8 further comprising pre-transforming said cell with at least one 

2 polynucleotide molecule encoding at least one trans-acting factor. 

1 1 0. The method of claim 8 further comprising introducing a trans-acting factor. 
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1 II. The method of claim 1 0 wherein said introducing a trans-acting factor comprises 

2 co-transfection of an expression plasm id comprising a nucleotide sequence encoding said trans- 

3 acting factor. 

1 12. The method of claim 10 wherein said introducing a trans-acting factor comprises 

2 co-transfection of an RNA transcript encoding said trans-acting factor. 

1 13. The method of claim 10 wherein said trans-acting factor is stably expressed. 

1 14. The method of claim 8 wherein said modified viral RNA comprises an exogenous 

2 RNA segment. 

1 1 5. The method of claim 8 wherein said DNA-launching platform comprises a ribozyme 

2 sequence. 

1 16. The method of claim 8 wherein said DNA-launching platform comprises a 

2 promoter. 

1 17. The method of claim 8 wherein said DNA-launching platform comprises a 

2 termination sequence. 

1 18. The method of claim 8 wherein said DNA-launching platform comprises a 

2 restriction site. 

1 19. The modified cell produced by the method of claim 8. 

1 20. A method of producing a plant or plant tissue comprising at least one genotypically 

2 or phenotypically modified cell, said method comprising transfecting cells of said plant or plant 

3 tissue with a DNA-launching platform, wherein said DNA-launching platform comprises a 

4 polynucleotide encoding a modified RNA molecule, such that said polynucleotide molecule is 

5 transcribed to form a replicatable RNA transcript. 

1 21. The method of claim 20 wherein said modified RNA molecule comprises an 

2 exogenous RNA segment. 
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1 22. The method of claim 20 wherein said DNA-launching platform comprises a 

2 ribozyme sequence. 

1 23. The method of claim 20 wherein said DNA-launching platform comprises a 

2 promoter. 

1 24. The method of claim 20 wherein said DNA-launching platform comprises a 

2 termination sequence. 

1 25. The method of claim 20 wherein said DNA-launching platform comprises a 

2 restriction site. 

1 26. A method of producing a genotypically or phenotypically modified plant comprising 

2 obtaining at least one modified cell produced by the method of claim 8; and subjecting said 

3 modified cell to conditions whereby a plant is regenerated therefrom. 

1 27. A plant produced by the method of claim 26. 

1 28. A plant descended from the plant of claim 27. 

1 29. The method of claim 20, wherein said plant or plant tissue comprises one or more 

2 cells transformed with a polynucleotide molecule encoding at least one trans-acting factor, 

3 wherein said polynucleotide molecule is expressed. 

1 30. The method of claim 29, wherein said modified viral RNA molecule is capable of 

2 replication only in said one or more cells transformed with a polynucleotide molecule encoding 

3 at least one trans-acting factor. 
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SEQUENCE LISTING 

<110> WISCONSIN ALUMNI RESEARCH FOUNDATION 

10 Street Address: 614 Walnut Street 

City: Madison 

State: Wisconsin 

Country: US 

ZIP: 53705 

•5 Phone number: (608) 265-2135 Fax: (608) 263-1064 

<120> Improved Methods and Materials for Transformation 

<130> WARF-100XC1 



20 



35 



40 



<140> 
<141> 



<150> 60/086,526 
25 <151> 1998-05-22 

<160> 8 

<170> Patentln Ver . 2.0 

30 

<210> 1 
<211> 7074 
<212> DNA 

<213> Brome mosaic virus 
<400> 1 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 6 0 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 
GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT C ACTATCAG C 24 0 
45 TAG C AAAT AT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 36 0 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 

50 

AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 480 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
55 TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATG CAAT GCGGCGGCTG CAT ACG CTTG 720 

60 
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ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 84 0 
65 CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 96 0 
TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CAT AG CGTTG GCTACCCGTG 1020 

70 

ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 114 0 
75 NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 

80 

CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 144 0 
85 TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 162 0 

90 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
95 GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 1800 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 1920 
AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TT AG CTC ACT 1980 
CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
CTGCAGGTCG ACTCTAGAGG ATCCCCGGTC ACTGGATTTT GGTTTTAGGA ATTAGAAATT 2160 
TTATTGATAG AAGTATTTTA CAAATACAAA TACATACTAA GGGTTTCTTA TATGCTGAAC 2220 
ACATG AG CG A AACCCTATAA GAACCCTAAT TCCCTTATCT GGGAACTACT CACACATTAT 2280 
TCTGGAGAAA ATAGAGAGAG ATAGATTTGT AGAGAGAGAC TGGTGATTTG CGGACTCTAG 2340 



100 



105 



110 
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115 AGGATCCCCG GGTACCGAGC TCGAATTCTC GAGCAGAGGT CTCACACAGA GACAAGCGCA 24 00 
TCACTTAACA CAATTAAAGA TCAAATCACC AGCGAGCTCG CCGTTAAAGC AAT ACT C AAA 24 60 
GGACTTCTTG TGTCGTGTTA AGGCAACCAA ACAGTACTCC TCATGTTTAA ACAAATCACA 2520 

120 

TTTGGTCGAC TTAAG CCGAA CCAAAGTGAC GTTGTCAACA GAGATCCCTT GCGCTTCGTG 25 80 
TACTGTTTTT ATGTGTCCAT CAATCCAGTC CTTGCTCACG GGAAAATCCT TAGCCCTCGT 264 0 
125 TTGAAGGGCC GCTTTATCAG CTTGAGTCAT CGTAAGATAC GTTCTGTTCG GATCAATAGT 27 00 
GACCTGCAAA C CAGAAGTAA TACGACGCTT CGTGAGACTT CTAGAAACTT TGGACTCAGA 2760 
TGTCCAGGAT TGATACTTCG TGTCCCTATT ACCGCATTTA CGCTTCAGCA GATTAACAGC 282 0 

130 

AGCGATAACA TCTTGCGGAC ACCGGTAAGT CTTGTGAACA ACGTCACGGC GATCATATTG 28 80 
CAGATTACCG TGGAGCAATT TAAAACCCGC GTCACGAGAC TTGAACGAAA TCTGCTCTGT 294 0 
135 GTCCCCAAAG GCAAGAACTT GTGAACATTT AGACAGAGCA GCCACCACCA GGAGTTGACC 3 0 00 
ATAATGTAGT AAACCAGCCT CATCAACAAG C AG CCTATG A CAGGACGGTA CACCGTGCAT 3060 
GATCGCAGAA TCCGCGGTGC GCACAACGTC CAAAGCTACC TTGGAATTAT AAGTGTCAGG 3120 

140 

GAATAAAGCC ATCCTGACGT CCTCGGCCGA TTTACGATTC GCCGTCACAA TTAGGTCCTC 318 0 
TCCCATACGG AATGCATCTT TTATGGCAGT GGTTTTACCG CATCCCGCAA CTCCATCAAC 324 0 
145 CATGGAAATA TCGCATGTAG GGACAGAAAC TTTGGCGCTA GCTTCTGCAA TGTCCCTCAA 33 00 
GTTAGAGCAT GCACATGTTT TATCAACAAT GTACGTTTCA TCTGCGTGCT TCGGACCTAA 3360 
ACCATGCTCA TTATATCCAA CAGTGTAATC GTATTTTTTA GGATACAACC AGTTACCGTT 34 2 0 

150 

GGCCAAATGG ACATTCACCA TATCGTCTAT GCGATGGTAG GTCTCAAAGA TGCTCTTATT 3480 
TGCGATCTCA CTTCCGCGAC CGCCGGAAAT GTCCCATAGG TGACGAAGAT TAGACTCGGA 354 0 
155 G TTGTTATGT AATCTCTTAC AATAACGCAC AAATTCCTTC ATGGCTCCGT GTCTAGATAT 36 00 
GCCACGAGGG TCCGTTGGTA CCTCAACAGA CACCTCGGCA TCCGGGACCA CATCAGTCAC 36 60 
CGGTTTAACG T CATCAC TG A CGGACTCAGG GCTCGAACTC TCAGGGGCAT CATGAAACTC 3720 

160 

CTCCTGAGGT ATCTCAGCAG CTGGCGGGAC TTTCGCCTTC TTCTTCGAGC GCTTGGTCTT 3780 
GGCTGTCTGC ACTTCATGCT CCAGCCGGTC GAATAAGTCC TCTTCAGTCC AAAACGTTCT 3840 
165 CAAACGTGAT ATCGGTACAG AATCTTGCTC AAATTCTTCA ACGTTTGAGA GACGAGTCAG 3 900 
AAACTTAAAA CTGTCCGCAT AAGAATCCAG ACGTAGTAGG GGAAATCTGC TAGCCAATGT 3960 
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TCTCAGCCAT CCTACTTTCG CCCTGGATGA ATCTCCACCC CACCAAAACC TAGTTTTGAA 4020 

170 

GTGATGGCAC CAACCTTTCC ATTCCATCCC ATCGCGGAGG GCCGTAAGCT TTTCGTACTT 4080 
TTGATACAGA TTCAAAGTCA AAGCAAAGGC CACTAGATGA TAATCTTCAA TGTCTAAGCG 4140 
175 CTCACCAGCC ATGATAGCCT GACCGTTAAT AATAACAGTC GACGACTTGG CGGATAAGAT 4200 
AGATGCGACA GCTTTCATGT TCTCAGTCCA TTCTTTACTT TCCTTGAAAC ATCTGAAAGC 4260 
TATCTCCTCT ACCTCTCTCA CTGTGGTTTT GGCGACGCGC ACACATTTCC AGCGATTGAG 4320 

180 

ACTCCAGTCT TCAGGTATTG AGACCCCTAC GTACTTAGAT ATGTCTTCAA ACCATACACA 4380 
GTGACGTAGT GTCTCCCGGG GGCAGCGTAA ATTTGTAGCG ATGATCTTAT AGGTCATGAT 4440 
185 GTTACATTTC AGCATTTCGC GCTCCAACAG ATAGGTGGTT CCATCGATGC AATGCACCGA 4500 
CTCGGTGAAA AATGAGCCCA AATCTTGCCA TCCGTGGATG TAAGATAATG TGCTTTCATT 4 560 
TTCAAAATCG AATTTGATCA CCTCATCCGC GCCTGACCCG TCACGTTGCC AGTGACATTT 4620 

190 

AAGCAAGGGA AGAAAACCCT CGCGGTCAAA CAACATGGCG CCGTCGAACA TAACGGTACC 4680 
ACGTAGTACG CGTACTCCAT GCGAATGCAT GGCGTCACAC AGACCTTGGA AGCCCATATC 4740 
195 ATAACCGCCG TGGATACAGA TAGCCCAATC AGCTTGGACA TCACAATCTT GAGCTCGGTT 4800 
AAGACAAAAG TTCGGGACTT CATCGAAATC ATCGCTTTCT TGCAAAATTT TTCGCATGCG 4860 
GCACATCCTC TCCTCATGTC GGGCAGCGTC TCTAACACCC AACACAGGAC AACAACTGTG 4920 

200 

CACCCTTTTA TCCCTTCTTG AAAAGTGATG CCACCAAGAC CCTCCGAAAT CTATAACGGG 4980 
• GTCTTCAGGG GGAAAACTGT CGAGACAGTC ATAATGCTCC GCTACACGCA GAGCACCAGC 5040 
205 CAGGCTATGG GGCGCATGAT ACTGCTGAGT CAAATTTAAG TCAAAGGCAC CAC CATAACG S100 
GTCACGGAAG GCGTCAGCCT CCTCAATAGA GAGCTTATTG CGAACGTTGA TTTTCTTAGA 5160 
CCTTTTCGCG TATTCAATCT GCGCAGATAA CTGTTGCGCA ACCTGATTGT CTACGATGTC 5220 

210 

TTGGGCACTC TGGCTGTCAG CACCCTTCTC AGCAATCAAC TTCAGCAAAT CGATAGAACT 5280 
TGACATTTTG TTGGTGAAAA ACAAAGAACA AGTAGCAGAA CCGTGGTCGA GGTCCTCTCC 5340 
215 AAATGAAATG AACTTCCTTA TATAGAGGAA GGGTCTTGCG AAGGATAGTG GGATTGTGCG 5400 
TCATCCCTTA CGTCAGTGGA GATATCACAT CAATCCACTT GCTTTGAAGA CGTGGTTGGA 5460 
ACGTCTTCTT TTTCCACGAT GTTCCTCGTG GGTGGGGGTC CATCTTTGGG ACCACTGTCG 5520 

220 

GTAGAGGCAT TCTTGAACGA TAGCCTTTCC TTTATCGCAA TGATGGCATT TGTAGAAGCC 5580 
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ATCTTCCTTT TCTACTGTCC TTTCGATGAA GTGACAGATA GCTGGGCAAT GGAATCCGAG 564 0 
225 GAGGTTTCCC G AT ATTACC C TTTGTTGAAA AGTCTCAATA GCCCTCTGGT CTTCTGAGAC 5700 
TGTATCTTTG ATATTCTTGG AGTAGACGAG AGTGTCGTGC TCCACCATGT TGACCGGGTG 5760 
GTCAGTCCCT TATGTTACGT CCTGTAGAAA CCCCAACCCG TGAAATCAAA AAACTCGACG 582 0 

230 

GCCTGTGGGC ATTCAGTCTG GATCGCGAAA ACTGTGGAAT TGATCAGCGT TGGTGGGAAA 5880 
GCGCGTTACA AGAAAGCCGG GCAATTGCTG TGC CAGGCAG TTTTAACGAT CAGTTCGCCG 594 0 
235 ATGCAGATAT TCGTAATTAT GCGGGCAACG TCTGGTATCA GCGCGAAGTC TTTATACCGA 6000 
AAGGTTGGGC AGGCCAGCGT ATCGTGCTGC GTTTCGATGC GGTCACTCAT TACGGCAAAG 6060 
TGTGGGTCAA TAATCAGGAA GTGATGGAGC ATCAGGGCGG CTATACGCCA TTTGAAGCCG 6120 

240 

ATGTCACGCC GTATGTTATT GCCGGGAAAA GTGTACAATT CACTGGCCGT CGTTTTACAA 6180 
CGTCGTGACT GGGAAAACCC TGGCGTTACC CAACTTAATC GCCTTGCAGC ACATCCCCCT 624 0 
245 TTCGCCAGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC 6300 
AGCCTGAATG GCGAATGNNN NNNNAATTCA GTACATTAAA AACGTCCGCA ATGTGTTATT 636 0 
AAGTTGTCTA AGCGTCAATT TGTTTACACC ACAATATATC CTGCCACCAG CCAGCCAACA 642 0 

250 

GCTCCCCGAC CGGCAGCTCG GCACAAAATC ACCACTCGAT ACAGGCAGCC CATCAGNNNN 6480 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6540 
255 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6600 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6660 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6720 

260 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6780 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6840 
265 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7020 

270 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNN 7074 



<210> 2 

275 <211> 6750 

<212> DNA 

<213> Brome mosaic virus 
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<400> 2 . 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 

280 

AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA G CCGTTTT AC GTTTGGAACT 120 
GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
285 AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
TAG CAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 

290 

TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 480 
295 TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 66 0 

300 

TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 
ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
305 GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 

310 

TCGACTGTGG CCGGCTGGGT GTGGCGGACC G CTATCAGG A CATAGCGTTG GCTACCCGTG 1020 
ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
315 CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 

320 

TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
325 TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 

330 

GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 
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CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
335 GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG . GTGCTACTGG TGATTTTGCT 1740 
GGCTCTAATT CC.CAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 18 00 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 

340 

CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 1920 
AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 198 0 
345 CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
CTGCAGGTCG ACTCTAGAGG ATCCCCGGTC AACATGGTGG AGCACGACAC TCTCGTCTAC 2160 

350 

TCCAAGAATA TCAAAGATAC AGTCTCAGAA GACCAGAGGG CTATTGAGAC TTTTCAACAA 222 0 
AGGGTAATAT CGGGAAACCT CCTCGGATTC CATTGCCCAG CTATCTGTCA CTTCATCGAA 2280 
355 AGGACAGTAG AAAAGGAAGA TGGCTTCTAC AAATGCCATC ATTGCGATAA AGGAAAGGCT 234 0 
ATCGTTCAAG AATGCCTCTA CCGACAGTGG TCCCAAAGAT GGACCCCCAC CCACGAGGAA 24 00 
CATCGTGGAA AAAGAAGACG TTCCAACCAC GTCTTCAAAG CAAGTGGATT GATGTGATAT 2460 

360 

CTCCACTGAC GTAAGGGATG ACGCACAATC CCACTATCCT TCGCAAGACC CTTCCTC TAT 252 0 
ATAAGGAAGT TCATTTCATT TGGAGAGGAC CTCGACCACG GTTCTGCTAC TTGTTCTTTG 258 0 
365 TTTTTCACCA ACAAAATGTC AAGTTCTATC GATTTGCTGA AGTTGATTGC TGAGAAGGGT 264 0 
GCTGACAGCC AGAGTGCCCA AGACATCGTA GACAATCAGG TTGCGCAACA GTTATCTGCG 27 00 
CAGATTGAAT ACGCGAAAAG GTCTAAGAAA ATCAACGTTC GCAATAAGCT CTCTATTGAG 2 760 

370 

GAGGCTGACG CCTTCCGTGA CCGTTATGGT GGTGCCTTTG ACTTAAATTT GACTCAGCAG 2820 
TATCATGCGC CCCATAGCCT GGCTGGTGCT CTGCGTGTAG CGGAGCATTA TGACTGTCTC 2 88 0 
375 GACAGTTTTC CCCCTGAAGA CCCCGTTATA GATTTCGGAG GGTCTTGGTG GCATCACTTT 2 94 0 
TCAAGAAGGG ATAAAAGGGT GCACAGTTGT TGTCCTGTGT TGGGTGTTAG AGACGCTGCC 3000 
CGACATGAGG AGAGGATGTG CCGCATGCGA AAAATTTTGC AAGAAAGCGA TGATTTCGAT 3060 

380 

GAAGTCCCGA ACTTTTGTCT TAACCGAGCT CAAGATTGTG ATGTCCAAGC TGATTGGGCT 3120 
ATCTGTATCC ACGGCGGTTA TGATATGGGC TTCCAAGGTC TGTGTGACGC CATGCATTCG 3180 
385 CATGGAGTAC GCGTACTACG TGGTACCGTT ATGTTCGACG GCGCCATGTT GTTTGACCGC 324 0 
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GAGGGTTTTC TTCCCTTGCT TAAATGTCAC TGGCAACGTG ACGGGTCAGG CGCGGATGAG 33 00 
GTGATCAAAT TCGATTTTGA AAATGAAAGC ACATTATCTT ACATCCACGG ATGGCAAGAT 336 0 
TTGGGCTCAT TTTTCACCGA GTCGGTGCAT TGCATCGATG GAACCAGCTA TCTGTTGGAG 3420 
CGCGAAATGC TGAAATGTAA CATCATGACC TATAAGATCA TCGCTACAAA TTTACG CTGC 34 80 
395 CCCCGGGAGA CACTACGTCA CTGTGTATGG TTTGAAGACA TATCTAAGTA CGTAGGGGTC 354 0 
TCAATACCTG AAGACTGGAG TCTCAATCGC TGGAAATGTG TGCGCGTCGC CAAAACCACA 3600 
GTGAGAGAGG TAGAGGAGAT AGCTTTCAGA TGTTTCAAGG AAAGTAAAGA ATGGACTGAG 366 0 

400 

AACATGAAAG CTGTCGCATC TATCTTATCC GCCAAGTCGT CGACTGTTAT TATTAACGGT 3 72 0 
CAGGCTATCA TGGCTGGTGA GCGCTTAGAC ATTGAAGATT ATCATCTAGT GGCCTTTGCT 3780 
405 TTGACTTTGA ATCTGTATCA AAAGTACGAA AAG CTTACGG CCCTCCGCGA TGGGATGGAA 384 0 
TGGAAAGGTT GGTGCCATCA CTTCAAAACT AGGTTTTGGT GGGGTGGAGA TTCATCCAGG 3 900 
G CG AAAG TAG GATGGCTGAG AACATTGGCT AGCAGATTTC CCCTACTACG TCTGGATTCT 3960 

410 

TATGCGGACA GTTTTAAGTT TCTGACTCGT CTCTCAAACG TTGAAGAATT TGAGCAAGAT 4 020 
TCTGTAC CG A TATCACGTTT GAGAACGTTT TGGACTGAAG AGGACTTATT CGACCGGCTG 4080 
415 GAGCATGAAG TGCAGACAGC CAAGACCAAG CGCTCGAAGA AGAAGGCGAA AGTCCCGCCA 4140 
GCTGCTGAGA TACCTCAGGA GGAGTTTCAT GATGCCCCTG AGAGTTCGAG CCCTGAGTCC 42 00 
GTCAGTGATG ACGTTAAACC GGTGACTGAT GTGGTCCCGG ATGCCGAGGT GTCTGTTGAG 426 0 

420 

GTACCAACGG ACCCTCGTGG CATATCTAGA CACGGAGCCA TGAAGGAATT TGTGCGTTAT 4320 
TGTAAGAGAT TACATAACAA CTCCGAGTCT AATCTTCGTC ACCTATGGGA CATTTCCGGC 4380 
425 GGTCGCGGAA GTGAGATCGC AAATAAGAGC ATCTTTGAGA CCTACCATCG CATAGACGAT 444 0 
ATGGTGAATG TCCATTTGGC CAACGGTAAC TGGTTGTATC CTAAAAAATA CGATTACACT 4 500 
GTTGGATATA ATGAGCATGG TTTAGGTCCG AAGCACGCAG ATGAAACGTA CATTGTTGAT 4 560 

430 

AAAACATGTG CATGCTCTAA CTTGAGGGAC ATTGCAGAAG CTAGCGCCAA AGTTTCTGTC 4620 
CCTACATGCG ATATTTCCAT GGTTGATGGA GTTGCGGGAT GCGGTAAAAC CACTGCCATA 4680 
435 AAAGATG CAT TCCGTATGGG AGAGGACCTA ATTGTGACGG CGAATCGTAA ATCGGCCGAG 474 0 
GACGTCAGGA TGGCTTTATT CCCTGACACT TATAATTCCA AGGTAG CTTT GGACGTTGTG 4800 
CGCACCGCGG ATTCTGCGAT CATGCACGGT GTACCGTCCT GTCATAGGCT GCTTGTTGAT 4860 

440 
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GAGGCTGGTT TACTACATTA TGGTCAACTC 
CAAGTTCTTG CCTTTGGGGA CACAGAGCAG 
445 AAATTGCTCC ACGGTAATCT GCAATATGAT 
TGTCCGCAAG ATGTTATCGC TGCTGTTAAT 
ACGAAGTATC AATCCTGGAC ATCTGAGTCC 

450 

ATTACTTCTG GTTTGCAGGT CACTATTGAT 
GCTGATAAAG CGGCCCTTCA AACGAGGGCT 
455 GATGGACACA TAAAAACAGT ACACGAAGCG 
GTTCGGCTTA AGTCGACCAA ATGTGATTTG 
TTAACACGAC ACAAGAAGTC CTTTGAGTAT 

460 

ATCTTTAATT GTGTTAAGTG ATGCGCTTGT 
AGCTCGGTAC CCGGGGATCC TCTAGAGTCC 
465 TCTCTCTCTA TTTTCTCCAG AATAATGTGT 
CTTAT AG GGT TTCG CTCATG TGTTGAGCAT 
GTAAAATACT T CTATCAAT A AAATTTCTAA 

470 

GTCAGTCCCT TATGTTACGT CCTGTAGAAA 
GCCTGTGGGC ATTCAGTCTG GATCGCGAAA 
475 GCGCGTTACA AGAAAGCCGG GCAATTG CTG 
ATGCAGATAT TCGTAATTAT GCGGGCAACG 
AAGGTTGGGC AGG CCAGCGT ATCGTGCTGC 

480 

TGTGGGTCAA TAATCAGGAA GTGATGGAGC 
ATGTCACGCC GTATGTTATT GCCGGGAAAA 
485 CGTCGTGACT GGGAAAACCC TGGCGTTACC 
TTCGCCAGCT GGCGTAATAG CGAAGAGGCC 
AGCCTGAATG GCGAATGNNN NNNNAATTCA 

490 

AAGTTGTCTA AGCGTCAATT TGTTTACACC 
GCTCCCCGAC CGGCAGCTCG GCACAAAATC 
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CTGGTGGTGG CTGCTCTGTC TAAATGTTCA 4 92 0 
ATTTCGTTCA AGTCTCGTGA CGCGGGTTTT 4 980 

CGCCGTGACG TTGTTCACAA GACTTACCGG 504 0 

CTG CTGAAGC GTAAATGCGG TAATAGGGAC 5100 

AAAGTTTCTA GAAGTCTCAC GAAGCGTCGT 5160 

CCGAACAGAA CGTATCTTAC GATGACTCAA 5220 

AAGGATTTTC CCGTGAGCAA GGACTGGATT 5280 

CAAGGGATCT CTGTTGACAA CGTCACTTTG 534 0 

TTTAAACATG AGGAGTACTG TTTGGTTGCC 5400 

TGCTTTAACG GCGAGCTCGC TGGTGATTTG 5460 

CTCTGTGTGA G AC CTCTGCT CGAGAATTCG 552 0 

GCAAATCACC AGTCTCTCTC TACAAATCTA 5580 

GAGTAGTTCC CAGATAAGGG AATTAGGGTT 564 0 

ATAAGAAACC CTTAGTATGT ATTTGTATTT 5700 

TTCCTAAAAC C AAAATC C AG TGACCGGGTG 5760 

CCCCAACCCG TGAAATCAAA AAACTCGACG 5820 

ACTGTGGAAT TGATCAGCGT TGGTGGGAAA 588 0 

TGCCAGGCAG TTTTAACGAT CAGTTCGCCG 594 0 

TCTGGTATCA GCGCGAAQTC TTTATACCGA 6000 

GTTTCGATGC GGTCACTCAT TACGGCAAAG 6 060 

ATCAGGGCGG CTATACGCCA TTTGAAGCCG 6120 

GTGTACAATT CACTGGCCGT CGTTTTACAA 6180 

CAACTTAATC GCCTTGCAGC ACATCCCCCT 624 0 

CGCACCGATC GCCCTTCCCA ACAGTTGCGC 6300 

GTACATTAAA AACGTCCGCA ATGTGTTATT 6360 

ACAATATATC CTGCCACCAG CCAGCCAACA 6420 

ACCACTCGAT ACAGGCAGCC CATCAGNNNN 6480 
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495 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6540 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6600 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6660 

500 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6720 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6750 

505 

<210> 3 
<211> 6426 
<212> DNA 

<213> Brome mosaic virus 

510 

<400> 3 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 6 0 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 

515 

GACAGAACCG CAACGATTGA AGGAGC CACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
520 TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 

525 

AGACAATCGG CTG CTCTG AT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 480 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
530 TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 

535 

ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
GG ATGGAAG C CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
540 C AG CCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 
TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CAT AG CGTTG G CT ACCCGTG 1020 

545 

ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTG CTT TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACG AG TTC TTCTGANNNN 114 0 
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550 NNNNNNNNHN NNNNNNNNNN GATCGTTCAA 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA 
TAATAATTAA CATGTAATGC ATGACGTTAT 

555 

CGCAATTATA CATTTAATAC GCGATAGAAA 
TATCGCGCGC GGTGTCATCT ATGTTACTAG 
560 TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT 

565 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT 
570 GGCTCTAATT CCCAAATGGC TCAAGTCGGT 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA 
CCAATACGCA AACCGCCTCT CCCCGCGCGT 

575 

AGGTTTCCCG ACTGGAAAGC GGG CAGTGAG 
CATTAGGCAC CCCAGGCTTT ACACTTTATG 
580 AGCGGATAAC AATTTCACAC AGGAAACAGC 
CTGCAGGTCA CTGGATTTTG GTTTTAGGAA 
AAATACAAAT ACATACTAAG GGTTTCTTAT 

585 

AACCCTAATT CCCTTATCTG GGAACTACTC 
TAGATTTGTA GAGAGAGACT GGTGATTTGC 
590 TAGCCAAAGT GGTCTGCCTG ACCAGGAGTT 
TTAGGTTCAT ATATCATAGA ACCGATCATC 
GGGACTTCAC GAGCAAAGCA TCAACTGACG 

595 

CGACCTTCTT TTTCAAG CGT TGTGTGGTCC 
GCTCAAGGGC CTCATGTAAA CGTCCGATCC 
600 CCGAGGAATA AATTCCAATG GTTCTGTAGA 
TCTTCCAGTT GTGAAAC CAG TCACCATCAG 
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ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
ACAAAATATA GCGCGCAAAC TAGGATAAAT 138 0 
ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
GGTGGTGGCT CTGAGGG TGG CGGTTCTGAG 1500 
GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 156 0 
ATGACCGAAA ATGCCGATGA AAACGCGCTA 162 0 
GTCGCTACTG ATTACGGTGC TGCTATCGAT 168 0 
AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
GACGGTGATA ATTCACCTTT AATGAATAAT 1800 
TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
TGGCCGATTC ATTAATG CAG CTGGCACGAC 1920 
CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 
CTTCCGGCTC GTATGTTGTG TGGAATTGTG 2040 
TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
TTAGAAATTT TATTGATAGA AGTATTTTAC 216 0 
ATGCTCAACA CATGAGCGAA ACCCTATAAG 2220 
ACACATTATT CTGGAGAAAA TAGAGAGAGA 228 0 
GGACTCTAGA GGATCCCCAG CTTTTAAACT 234 0 
TTTAACCTTA ACCAAAGGGC TGTTCACAGC 2400 
TCAGATCAGA GGGCTTAAAA GTCTCACAAT 2460 
TTAGGCCTCC TCTACCGGTA GCGTAATCGT 2520 
TACGATCATT AGCTAATTTG AGTGACTCAC 2580 
GTTTGACAGG GAGCTCCTTA GTACTACAGT 264 0 
CTTTGTCTAA CACACCAGGA AACTTTGGAT 2700 
TTTTACGCTC TTCCGTGGTG CGTTTGAACT 276 0 



605 



610 



615 
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TACATACAGG ATCGCTCATC TGATAAACTC TGATGCCTTC GGTACAGTAG CAATCAGAGA 2820 
ACCTCAGGAA ATTCTCGGAG TATAAAGAAA AAGCCGCAAG AGCAGCTCTA ACCTCCTCGA 288 0 
AAATCCAAGG TTTTTCTTTC CCATATTTCA GATAAACAAA ATGACAGAGC GTCGTAATCA 294 0 
TCTTCTCATC AAGTTGATTA ATAAACTTCA TTCGATCACA GAAGGAAACG AAATGTGCTC 3000 
TGAGCATCTG TTCATCACGC AGAATCTTTC GCTTAGCTAA GCGCTGGATC TCTCTCAGAG 3060 
GATCTGGTAC AGACACCAAA TTGCCCATTT CAGTTTCGAC GAGAAACTTA CTACAAACGT 3120 
AGGGCACACT AGGGTCCATG ACTTTTATCT CCATATTGAA GAGAGACGTA AACATATCGG 318 0 
TATCCAGGAC TGGCTTAACT TTAGAGATGA TTAAAGAATC ATCTCCTGAA AATATTGCAC 3240 
620 AGTCACAGTC ACTTAGATCA GAGGCATATG CAATCATAGC CATAGTGACA AGAGTATTAC 3300 
CGAAATATGT AAACGCGTCA CCAGTTCTGC GTTGGAAGGA AACGGACATT CCCACCTTGG 3360 
CATGAGGGTC TGATAAATAA GAATCGCGAT GAAAATCAGA CCACCAATTC GTCAGCGGCG 3420 

625 

CTGGAAAGCC CAGCGCAAGG AGTATCTCTC TCTGAAACTC TAGGTGCAGC TCACCCTGAG 3480 
ATTTATCAAA TTTGCTTAGG TCCGCTTCAA GAAAGTATCT GTTATTCAAG CGGACATTCT 3540 
630 TAAGCTCCAG AGAGGATATC TTTCCGATAG GCACAATGAA CCTGGATTTC AGGGCCAGTG 3600 
ATAACTTCTC GAAACAAGCA GTGAAAAAGG GTGAAAAATT ACTAGTCACA CCTTTACTAT 3660 
GAAATGTTAT AGTAG CTGCT ACTGCTCGTT CCAAGTGAAG GGTGTCAGTT ACAACAGGTT 3720 
TTACGTCAGA CTTCAGCATA TGCTGGTACC GACATAAATC AGTCT CTGCT GCCACATTCA 3780 
CACCTTGCAA GTCCATGTGC TTACCCCACT TCTTATGGTA CTCAAGACAT TTAGTCATGA 3840 
CATCCATAGA AGCTCTCAGA CAGTCTTCAC CGTCAACATT AAGGAATGTG CTACGAAAGC 3900 
GCTTTGCTAT AGCTTTCGCA GTGTCCTTCA TGTTAATCGC GTCTCCCATT TCTGGAACGT 3 960 
CCGCGTTTCG CTTTTTGAGT GCGGTTAAGA CTTCTTTCTG AGTACCAACT CTTCGCTGAG 4020 
CACTCCCGAT ATTCATTTTT GGTTGAAAAT ATTTATCGGG GTCCCTATAC CAGTCTACAT 4080 
CACTTTGCTT AAGTCTGATC CTATCAAAGT CCATGGAATA ATCACCATTT TCAACAAGGG 4140 
650 CTTGATGGTA CGAATCATCG AAATAAGCAT GGGTTGGCAG TATGGAATGA CTGGTCGCTT 4200 
CTGTTCTAGC AAGGCTGACT CTCTCCATAT AAATTGGCCC AGTAGAGATG TCAGGGTTAT 4260 
CTGGATGGCA GTGTGTATCA ATAACACGCG AAACCCTATG TTCAATAGGG TTCATGATTT 4 3 2 0 
GAAGAGTGAT GTCGTAATCA GTATTAGTAG TCTGAAACTC TTCATCAATG CCCATGTACC 4380 



635 



640 



645 



655 
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TATCTCCAAG GGTCAGCTCC TTGGGGGTAT CTCCAGTAAC ACGAACTTCC TCAATTTCAC 444 0 
660 AGTTCGAGGA ATCACTGGCG AGTTTTAGAT CGCTCGCATG ATCTTCATCG GCGGCAAACG 4 500 
ATACACCGTA ACCATCACTA GTATCCTCGG GATACCAGTC ATCAATTTCA TCTTCGAGCA 456 0 
CGAAAGAGCC CGGAATGTCA AGATATAACA TCCGTGCCAT TTCAGCTTGA GGAATCAGCG 4620 

665 

GTCTATCGGT GAACTGTTGA ACCATTTGTT GGACGGTGTC GCAAATAGAG CCCCAGCGCA 4680 
CTCGGTCAAA AGGGGGATCG AATACCCCTC CTATCTCCAA GGGCGCTATA GCTAATTTAA 4 74 0 
670 AACTCGCGAG AGATCCGTCA ATGGCAACTC CGTCTGCCGG CTCCTGCACC TGAAGG CTAG 4800 
CAGCCTCCAC CTCGTCTTCT AAGGATTGAT CTATGATCCA TTGGAAAGAC GGGACCTGGC 4860 
GAACGAAATC ATCATCCCAG GTTTTCGAAG ACATCTTGGT GATAGTAGAA AGAACAAGCA 4 920 

675 

CACAACAACA ACAAGGTCAG ATGTGTGTTG CGGGTACCGA GCTCGAATTC TCGAGGTCCT 4980 
CTCCAAATGA AATGAACTTC CTTATATAGA GGAAGGGTCT TG CGAAGG AT AGTGGGATTG 5040 
680 TGCGTCATCC CTTACGTCAG TGGAGATATC ACATCAATCC ACTTGCTTTG AAGACGTGGT 5100 
TGGAACGTCT TCTTTTTCCA CGATGTTCCT CGTGGGTGGG GGTCCATCTT TGGGACCACT 5160 
GTCGGTAGAG GCATTCTTGA ACGATAGCCT TTCCTTTATC GCAATGATGG CATTTGTAGA 5220 

685 

AGCCATCTTC CTTTTCTACT GTCCTTTCGA TGAAGTGACA GATAGCTGGG CAATGGAATC 5280 
CGAGGAGGTT TCCCGATATT ACCCTTTGTT GAAAAGTCTC AATAGCCCTC TGGTCTTCTG 5340 
690 AGACTGTATC TTTGATATTC TTGGAGTAGA CGAGAGTGTC GTGCTCCACC ATGTTGACCT 5400 
GCAGGCAGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGG TGGTCAGTCC 5460 
CTTATGTTAC GTCCTGTAGA AACCCCAACC CGTGAAATCA AAAAACTCGA CGGCCTGTGG 5520 

695 

GCATTCAGTC TGGATCGCGA AAACTGTGGA ATTG ATCAG C GTTGGTGGGA AAGCGCGTTA 5580 
CAAGAAAGCC GGGCAATTGC TGTGCCAGGC AGTTTTAACG ATCAGTTCGC CGATG CAG AT 564 0 
700 ATTCGTAATT ATGCGGGCAA CGTCTGGTAT CAGCGCGAAG TCTTTATACC GAAAGGTTGG 5700 
GCAGGCCAGC GTATCGTGCT GCGTTTCGAT GCGGTCACTC ATTACGGCAA AGTGTGGGTC 5760 
AATAATCAGG AAGTGATGGA GCATCAGGGC GGCTATACGC CATTTGAAGC CGATGTCACG 5820 

705 

CCGTATGTTA TTGCCGGGAA AAGTGTACAA TTCACTGGCC GTCGTTTTAC AACGTCGTGA 5880 
CTGGGAAAAC CCTGGCGTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG 5940 
710 CTGGCGTAAT AGCGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA 6000 
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TGGCGAATGN NNNNNNAATT CAGTACATTA AAAACGTCCG CAATGTGTTA TTAAGTTGTC 6 06 0 
TAAGCGTCAA TTTGTTTACA CCACAATATA TCCTGCCACC AGCCAGCCAA CAGCTCCCCG 6120 

715 

ACCGGCAGCT CGGCACAAAA TCACCACTCG ATACAGGCAG CCCATCAGNN NNNNNNNNNN 6180 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6240 
720 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6300 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6360 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6420 

725 

NNNNNN 6426 

<210> 4 
730 <211> 6500 
<212> DNA 

<213> Brome mosaic virus 
<400> 4 

735 AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 6 0 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 
GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 

740 

AAG C AC AT AC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 3 00 
745 AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 4 80 

750 

TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
755 CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 
ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 

760 

GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTGGTCGTGA 900 
765 CC CATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 



WO 99/61597 



PCT/US99/11250 



770 



15 

TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 
ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT. TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 114 0 
NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
775 ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 138 0 

780 

TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 144 0 
TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
785 GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 
GAAAAGATGG CAAACGCTAA TAAGG GGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 
CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 16 80 

790 

GGTTTCATTG GTGACGTTTC CGGCC TTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 1800 
795 TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATT AATG C AG CTGGCACGAC 192 0 
AGGTTTC CCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 198 0 

800 

CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACG CCAA GCTTGCTGCC 2100 
805 TGCAGGTCAA CATGGTGGAG CACGACACTC TCGTCTACTC CAAGAATATC AAAGATACAG 216 0 
TCTCAGAAGA CCAGAGGGCT ATTGAGACTT TTCAACAAAG GGTAATATCG GGAAACCTCC 2220 
TCGGATTCCA TTGCCCAGCT ATCTGTCACT TCATCGAAAG GACAGTAGAA AAGGAAGATG 2280 

810 

GCTTCTACAA ATGCCATCAT TGCGATAAAG GAAAGGCTAT CGTTCAAGAA TGCCTCTACC 234 0 
GACAGTGGTC CCAAAGATGG ACCCCCACCC ACGAGGAACA TCGTGGAAAA AGAAGACGTT 24 00 
815 CCAACCACGT CTTCAAAGCA AGTGGATTGA TGTGATATCT CCACTGACGT AAGGGATGAC 2460 
GCACAATCCC ACTATCCTTC GCAAGACCCT TCCTCTATAT AAGGAAGTTC ATTTCATTTG 252 0 
GAGAGGACCT CGAGAATTCG AGCTCGGTAC CCGCAACACA CATCTGACCT TGTTGTTGTT 258 0 

820 
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GTGTGCTTGT TCTTTCTACT ATCACCAAGA TGTCTTCGAA AACCTGGGAT GATGATTTCG 2640 
TTCGCCAGGT CCCGTCTTTC CAATGGATCA TAGATCAATC CTTAGAAGAC GAGGTGGAGG 2700 
CTGCTAGCCT TCAGGTGCAG GAGCCGGCAG ACGGAGTTGC CATTGACGGA TCTCTCGCGA 2760 
GTTTTAAATT AGCTATAGCG CCCTTGGAGA TAGGAGGGGT ATTCGATCCC CCTTTTGACC 2820 
GAGTGCGCTG GGGCTCTATT TGCGACACCG TCCAACAAAT GGTTCAACAG TTCACCGATA 2880 
GACCGCTGAT TCCTCAAGCT GAAATGGCAC GGATGTTATA TCTTGACATT CCGGGCTCTT 2940 
TCGTGCTCGA AGATGAAATT GATGACTGGT ATCCCGAGGA TACTAGTGAT GGTTACGGTG 3000 
TATCGTTTGC CGCCGATGAA GATCATGCGA GCGATCTAAA ACTCGCCAGT GATTCCTCGA 3060 
ACTGTGAAAT TGAGGAAGTT CGTGTTACTG GAGATACCCC CAAGGAGCTG ACC CTTGG AG 3120 
ATAGGTACAT GGG CATTGAT GAAGAGTTTC AGACTACTAA TACTGATTAC GACATCACTC 3180 
TTCAAATCAT GAACC CTATT GAACATAGGG TTTCGCGTGT TATTGATACA CACTGCCATC 324 0 
CAGATAACCC TGACATCTCT ACTGGGCCAA TTTATATGGA GAGAGTCAGC CTTGCTAGAA 3300 
CAGAAGCGAC CAGTCATTCC ATACTGCCAA CCCATGCTTA TTTCGATGAT TCGTACCATC 3360 
AAGCCCTTGT TGAAAATGGT GATTATTCCA TGGACTTTGA TAGGATCAGA CTTAAGCAAA 3420 
GTGATGTAGA CTGGTATAGG GACCCCGATA AATATTTTCA ACCAAAAATG AATATCGGGA 3480 
GTGCTCAGCG AAGAGTTGGT ACTCAGAAAG AAGTCTTAAC CGCACTCAAA AAGCGAAACG 3540 
CGGACGTTCC AGAAATGGGA GACGCGATTA ACATGAAGGA CACTGCGAAA GCTATAGCAA 3600 
855 AGCGCTTTCG TAGCACATTC CTTAATGTTG ACGGTGAAGA CTGTCTGAGA GCTTCTATGG 3660 
ATGTCATGAC TAAATGTCTT GAGTACCATA AGAAGTGGGG TAAGCACATG GACTTGCAAG 3720 
GTGTGAATGT GGCAGCAGAG ACTGATTTAT GTCGGTACCA GCATATGCTG AAGTCTGACG 3780 
TAAAACCTGT TGTAACTGAC ACCCTTCACT TGGAACGAGC AGTAGCAGCT ACTATAACAT 3840 
TTCATAGTAA AGGTGTGACT AGTAATTTTT CACCCTTTTT CACTGCTTGT TTCGAGAAGT 3900 
TATCACTGGC CCTGAAATCC AGGTTCATTG TGCCTATCGG AAAGATATCC TCTCTGGAGC 3960 
TTAAGAATGT CCGCTTGAAT AACAGATACT TTCTTGAAGC GGACCTAAGC AAATTTGATA 4020 
AATCTCAGGG TGAGCTGCAC CTAGAGTTTC AGAGAGAGAT ACTCCTTGCG CTGGGCTTTC 4080 
CAGCGCCGCT GACGAATTGG TGGTCTGATT TTCATCGCGA TTCTTATTTA TCAGACCCTC 4140 
ATGCCAAGGT GGGAATGTCC GTTTCCTTCC AACGCAGAAC TGGTGACGCG TTTACATATT 4200 
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875 TCGGTAATAC TCTTGTCACT ATGGCTATGA TTGCATATGC CTCTGATCTA AGTGACTGTG 426 0 
ACTGTGCAAT ATTTTCAGGA GATGATTCTT TAATCATCTC TAAAGTTAAG CCAGTCCTGG 4 32 0 
ATACCGATAT GTTTACGTCT CTCTTCAATA TGGAGATAAA AGTCATGGAC CCTAGTGTGC 438 0 

880 

CCTACGTTTG TAGTAAGTTT CTCGTCGAAA CTGAAATGGG CAATTTGGTG TCTGTACCAG 444 0 
ATCCTCTGAG AGAGATCCAG CGCTTAGCTA AGCGAAAGAT TCTGCGTGAT GAACAGATGC 4 500 
885 TCAGAGCACA TTTCGTTTCC TTCTGTGATC GAATGAAGTT TATTAATCAA CTTGATGAGA 4 56 0 
AGATGATTAC GACGCTCTGT CATTTTGTTT ATCTGAAATA TGGGAAAGAA AAACCTTGGA 462 0 
TTTTCGAGGA GGTTAGAGCT GCTCTTGCGG CTTTTTCTTT ATACTCCGAG AATTTCCTGA 46 8 0 

890 

GGTTCTCTGA TTGCTACTGT ACCGAAGGCA TCAGAGTTTA TCAGATGAGC GATCCTGTAT 4 740 
GTAAGTTCAA ACGCACCACG GAAGAGCGTA AAACTGATGG TGACTGGTTT CACAACTGGA 4 8 00 
895 AGAATCCAAA GTTTCCTGGT GTGTTAGACA AAGTCTACAG AACCATTGGA ATTTATTCCT 4 860 
CGGACTGTAG TACTAAGGAG CTCCCTGTCA AACGGATCGG ACGTTTACAT GAGGCCCTTG 4920 
AGCGTGAGTC ACTCAAATTA GCTAATGATC GTAGGACCAC ACAACGCTTG AAAAAGAAGG 4 980 

900 

TCGACGATTA CGCTACCGGT AGAGGAGGCC TAACGTCAGT TGATGCTTTG CTCGTGAAGT 5040 
CCCATTGTGA GACTTTTAAG CCCTCTGATC TGAGATGATC GGTTCTATGA TATATGAACC 5100 
905 TAAGCTGTGA ACAGCCCTTT GGTTAAGGTT AAAAACTCCT GGTCAGGCAG ACCACTTTGG 5160 
CTAAGTTTAA AAGCTGGGGA TCCTCTAGAG TCCGCAAATC ACCAGTCTCT CTCTACAAAT 5220 
CTATCTCTCT CTATTTTCTC CAGAATAATG TGTGAGTAGT TCCCAGATAA GGGAATTAGG 5280 

910 

GTTCTTATAG GGTTTCGCTC ATGTGTTGAG CATATAAGAA ACCCTTAGTA TGTATTTGTA 5340 
TTTGTAAAAT ACTTCTATCA ATAAAATTTC TAATTCCTAA AACCAAAATC CAGTGACCTG 54 00 
915 CAGGCATGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGG TGGTCAGTCC 546 0 
CTTATGTTAC GTCCTGTAGA AACCCCAACC CGTGAAATCA AAAAACTCGA CGGCCTGTGG 552 0 
GCATTCAGTC TGGATCGCGA AAACTGTGGA ATTGATCAGC GTTGGTGGGA AAGCGCGTTA 5580 

920 

CAAGAAAGCC GGGCAATTGC TGTGCCAGGC AGTTTTAACG ATCAGTTCGC CGATGCAGAT 564 0 
ATTCGTAATT ATGCGGGCAA CGTCTGGTAT CAGCGCGAAG TCTTT AT AC C GAAAGGTTGG 5700 
925 GCAGGCCAGC GTATCGTGCT GCGTTTCGAT GCGGTCACTC ATTACGGCAA AGTGTGGGTC 5760 
AATAATCAGG AAGTGATGGA GCATCAGGGC GGCTATACGC CATTTGAAGC CGATGTCACG 5820 



930 
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CCGTATGTTA TTGCCGGGAA AAGTGTACAA TTCACTGGC C GTQGTTTTAC AACGTCGTGA 58 80 
CTGGGAAAAC CCTGGCGTTA CCCAACTTAA TGGCCTTGCA GCACATCCCC CTTTCGCCAG 5940 
CTGGCGTAAT AGCGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA 6000 
935 TGGCGAATGN NNNNNNAATT CAGTACATTA AAAACGTCCG CAATGTGTTA TTAAGTTGTC 6060 
TAAGCGTCAA TTTGTTTACA CCACAATATA TCCTGCCACC AGCCAGCCAA CAGCTCCCCG 6120 
ACCGGCAGCT CGGCACAAAA TCACCACTCG ATACAGGCAG CCCATCAGNN NNNNNNNNNN 6180 

940 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6240 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6300 
945 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6360 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6420 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6480 



950 



975 



980 



NNNNNNNNNN NNNNNNNNNN 6500 



<210> 5 
955 <211> 10100 
<212> DNA 

<213> Brome mosaic virus 
<400> 5 

960 AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 6 0 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 12 0 
GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 

965 

AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 24 0 
TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
970 AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 42 0 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AG CGCAGGGG CGCCCGGTTC 480 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACG CTTG 720 
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AT C CGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 78 0 

985 

GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 84 0 



CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
990 CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 



TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 



ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 

995 

CCGCTCCCGA TTCGCAGCGC ATCGC CTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 



NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 



1000 ATCCTGTTGC CGG TCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 



TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 



CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 

1005 

TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 



TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 



1010 GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 



GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 



CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 

1015 

GGTTT CATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 1740 



GGCTC TAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 1800 



1020 TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 186 0 
CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 192 0 



AGGTTTCCCG ACTGGAAAGC GGG CAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 1980 

1025 

CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 



AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
1030 CTGCAGGTCA CTGGATTTTG GTTTTAGGAA TTAGAAATTT TATTGATAGA AGTATTTTAC 2160 
AAATACAAAT ACATACTAAG GGTTT CTTAT ATGCTCAACA CATGAGCGAA ACCCTATAAG 222 0 



AACCCTAATT CCCTTATCTG GGAACTACTC ACACATTATT CTGGAGAAAA TAGAGAGAGA 2280 

1035 

TAGATTTGTA GAGAGAGACT GGTGATTTGC GGACTCTAGA GGATCCCCAG CTTTTAAACT 234 0 
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TAG CC AAAGT GGT CTGCCTG ACCAGGAGTT TTTAACCTTA ACCAAAGGGC TGTTCACAGC 2400 
1040 TTAGGTTCAT ATATCATAGA ACCGATCATC TCAGATCAGA GGGCTTAAAA GTCTCACAAT 246 0 
GGGACTTCAC GAGCAAAGCA TCAACTGACG TTAGGCCTCC TCTACCGGTA GCGTAATCGT 2520 
CGACCTTCTT TTTCAAGCGT TGTGTGGTCC TACGATCATT AGCTAATTTG AGTGACTCAC 2 580 
GCTCAAGGGC CTCATGTAAA CGTCCGATCC GTTTGACAGG GAGCTCCTTA GTACTACAGT 264 0 
CCGAGGAATA AATTCCAATG GTTCTGTAGA CTTTGTCTAA CACACCAGGA AACTTTGGAT 27 00 
1050 TCTTC CAGTT GTGAAACCAG TCACCATCAG TTTTACGCTC TTCCGTGGTG CGTTTGAACT 2760 
TACATACAGG ATCGCTCATC TGATAAACTC TGATGCCTTC GGTACAGTAG CAATCAGAGA 2 820 
ACCTCAGGAA ATTCTCGGAG TATAAAGAAA AAGCCGCAAG AGCAGCTCTA ACCTCCTCGA 2880 
AAATCCAAGG TTTTTCTTTC CCATATTTCA GATAAACAAA ATGACAGAGC GTCGTAATCA 2 94 0 
TCTTCTCATC AAGTTGATTA ATAAACTTCA TTCGATCACA GAAGGAAACG AAATGTGCTC 3000 
1060 TGAGCATCTG TTCATCACGC AGAATCTTTC GCTTAGCTAA GCGCTGGATC TCTCTCAGAG 3060 
GATCTGGTAC AGACACCAAA TTGCCCATTT CAGTTTCGAC GAGAAACTTA CTACAAACGT 3120 
AGGGCACACT AGGGTCCATG ACTTTTATCT CCATATTGAA GAGAGACGTA AACATATCGG 3180 
TATCCAGGAC TGGCTTAACT TTAGAGATGA TTAAAGAATC ATCTCCTGAA AATATTGCAC 3 240 
AGTCACAGTC ACTTAGATCA GAGGCATATG CAATCATAGC CATAGTGACA AGAGTATTAC 3 300 
1070 CGAAATATGT AAACG CGTCA CCAGTTCTGC GTTGGAAGGA AACGGACATT CCCACCTTGG 3360 
CATGAGGGTC TGATAAATAA GAATCGCGAT GAAAATCAGA CCACCAATTC GTCAGCGGCG 3420 
CTGGAAAGCC CAGCGCAAGG AGTATCTCTC TCTGAAACTC TAGGTGCAGC TCACCCTGAG 34 80 
ATTTATCAAA TTTGCTTAGG TCCGCTTCAA GAAAGTATCT GTTATTCAAG CGGACATTCT 3 540 
TAAGCTCCAG AGAGGATATC TTTCCGATAG GCACAATGAA CCTGGATTTC AGGGCCAGTG 3 600 
1080 ATAACTTCTC GAAACAAGCA GTGAAAAAGG GTGAAAAATT ACTAGTCACA CCTTTACTAT 3660 
GAAATGTTAT AGTAGCTGCT ACTGCTCGTT CCAAGTGAAG GGTGTCAGTT ACAACAGGTT 3720 
TTACGTCAGA CTTCAGCATA TGCTGGTACC GACATAAATC AGTCTCTGCT GCCACATTCA 3780 
CACCTTGCAA GTCCATGTGC TTACCCCACT TCTTATGGTA CTCAAGACAT TTAGTCATGA 3840 
CATCCATAGA AGCTCTCAGA CAGTCTTCAC CGTCAACATT AAGGAATGTG CTACGAAAGC 3900 
1090 GCTTTGCTAT AGCTTTCGCA GTGTCCTTCA TGTTAATCGC GTCTCCCATT TCTGGAACGT 3960 
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CCGCGTTTCG CTTTTTGAGT G CGGTTAAG A 
CACTCCCGAT ATTCATTTTT GGTTGAAAAT 

1095 

CACTTTGCTT AAGTCTGATC CTATCAAAGT 
CTTGATGGTA CGAATCATCG AAAT AAG CAT 
1 1 00 CTGTTCTAGC AAGGCTGACT CTCTCCATAT 
CTGGATGGCA GTGTGTATCA ATAACACGCG 
GAAGAGTGAT GTCGTAATCA GTATTAGTAG 

1105 

TATCTCCAAG GGTCAGCTCC TTGGGGGTAT 
AGTTCGAGGA ATCACTGGCG AGTTTTAGAT 
1110 ATACACCGTA ACCATCACTA GTATC CTCGG 
CGAAAGAGCC CGGAATGTCA AGATATAACA 
GTCTATCGGT GAACTGTTGA AC C ATTTGTT 

1115 

CTCGGTCAAA AGGGGGATCG AATACCCCTC 
AACTCGCGAG AGATCCGTCA ATGGCAACTC 
1120 CAGCCTCCAC CTCGTCTTCT AAGGATTGAT 
GAACGAAATC ATCATCCCAG GTTTTCGAAG 
CACAACAACA ACAAGGTCAG ATGTGTGTTG 

1125 

CTCCAAATGA AATGAACTTC CTTATATAGA 
TGCGTCATCC CTTACGTCAG TGGAGATATC 
1130 TGGAACGTCT TCTTTTTCCA CGATGTTCCT 
GTCGGTAGAG GCATTCTTGA ACGATAG CCT 
AGCCATCTTC CTTTTCTACT GTCCTTTCGA 

1135 

CGAGGAGGTT TCCCGATATT ACCCTTTGTT 
AGACTGT AT C TTTGATATTC TTGGAGTAGA 
1140 GCAGGCAGCA AGCTTGCATG CCTGCAGGTC 
GAGCACGACA CTCTCGTCTA CTCCAAGAAT 
GCTATTGAGA CTTTTCAACA AAGGGTAATA 

1145 
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CTTCTTTCTG AGTACCAACT CTTCGCTGAG 4020 
ATTTATCGGG GTCCCTATAC CAGTCTACAT 4080 
CCATGGAATA ATCACCATTT TCAACAAGGG 4140 
GGGTTGGCAG TATGGAATGA CTGGTCGCTT 42 00 
AAATTGGCCC AGTAGAGATG TCAGGGTTAT 4260 
AAACC CTATG TTCAATAGGG TTCATGATTT 4320 
TCTGAAACTC TTCATCAATG CCCATGTACC 4380 
CTCCAGTAAC ACGAACTTCC TCAATTTCAC 4440 
CGCTCGCATG ATCTTCATCG GCGGCAAACG 4500 
GATACCAGTC ATCAATTTCA TCTTCGAGCA 4560 
TCCGTGCCAT TTCAGCTTGA GGAATCAGCG 4620 
GGACGGTGTC GCAAATAGAG CCCCAGCGCA 46 80 
CTATCTCCAA GGGCGCTATA GCTAATTTAA 4740 
CGTCTGCCGG CTCCTGCACC TGAAGGCTAG 48 00 
CTATGATCCA TTGGAAAGAC GGGACCTGGC 4860 
ACATCTTGGT GATAGTAGAA AGAACAAGCA 4 920 
CGGGTACCGA GCTCGAATTC TCGAGGTCCT 4 980 
GGAAGGGTCT TGCGAAGGAT AGTGGGATTG 504 0 
ACATCAATCC ACTTG CTTTG AAGACGTGGT 5100 
CGTGGGTGGG GGTCCATCTT TGGGACCACT 5160 
TTCCTTTATC GCAATGATGG CATTTGTAGA 5220 
TGAAGTGACA GATAGCTGGG CAATGGAATC 5280 
GAAAAGTCTC AATAGCCCTC TGGTCTTCTG 534 0 
CGAGAGTGTC GTGCTCCACC ATGTTGACCT 5400 
GACTCTAGAG GATCCCCGGT CAACATGGTG 5460 
ATCAAAGATA CAGTCTCAGA AGACCAGAGG 5520 
TCGGGAAACC TCCTCGGATT CCATTGCCCA 5580 



1155 



1165 



WO 99/61597 PCT/US99/11250 

22 

GCT AT CTGTC ACTTCATCGA AAGGACAGTA GAAAAGGAAG ATGG CTTCT A C AAATG C CAT 564 0 
CATTGCGATA AAGGAAAGGC TATCGTTCAA GAATGCCTCT ACCGACAGTG GTCCCAAAGA 57 00 
1150 TGGACCCCCA CCCACGAGGA ACATCGTGGA AAAAGAAGAC GTTCCAACCA CGTCTTCAAA 5760 
GCAAGTGGAT TGATGTGATA TCTCCACTGA CGTAAGGGAT GACGCACAAT CCCACTATCC 58 2 0 
TTCGCAAGAC CCTTCCTCTA TATAAGGAAG TTCATTTCAT TTGGAGAGGA CCTCGACCAC 5880 
GGTTCTGCTA CTTGTTCTTT GTTTTTCACC AACAAAATGT CAAGTTCTAT CGATTTGCTG 594 0 
AAGTTGATTG CTGAGAAGGG TGCTGACAGC CAGAGTGCCC AAGACATCGT AGACAATCAG 6 000 
1160 GTTGCGCAAC AGTTATCTGC GCAGATTGAA TACGCGAAAA GGTCTAAGAA AATCAACGTT 6060 
CGCAATAAGC TCTCTATTGA GGAGGCTGAC GCCTTCCGTG ACCGTTATGG TGGTGCCTTT 6120 
GACTTAAATT TGACTCAGCA GTATCATGCG CCCCATAGCC TGGCTGGTGC TCTGCGTGTA 6180 
GCGGAGCATT ATGACTGTCT CGACAGTTTT CCCCCTGAAG ACCCCGTTAT AGATTTCGGA 624 0 
GGGTCTTGGT GGCATCACTT TTCAAGAAGG GATAAAAGGG TGCACAGTTG TTGTCCTGTG 6 3 00 
1170 TTGGGTGTTA GAGACGCTGC CCGACATGAG GAGAGGATGT GCCGCATGCG AAAAATTTTG 6360 
CAAGAAAGCG ATGATTTCGA TGAAGTCCCG AACTTTTGTC TTAACCGAGC TCAAGATTGT 64 2 0 
GATGTCCAAG CTGATTGGGC TATCTGTATC CACGGCGGTT ATGATATGGG CTTCCAAGGT 64 80 
CTGTGTGACG CCATGCATTC GCATGGAGTA CGCGTACTAC GTGGTACCGT TATGTTCGAC 6 54 0 
GGCGCCATGT TGTTTGACCG CGAGGGTTTT CTTCCCTTGC TTAAATGTCA CTGGCAACGT 6 6 00 
1180 GACGGGTCAG GCGCGGATGA GGTGATCAAA TTCGATTTTG AAAATGAAAG CACATTATCT 6660 
TACATCCACG G ATGG CAAGA TTTGGGCTCA TTTTTCACCG AGTCGGTGCA TTGCATCGAT 6720 
GGAACCACCT ATCTG TTGGA GCGCGAAATG CTGAAATGTA ACATCATGAC CTATAAGATC 678 0 
ATCGCTACAA ATTTACGCTG CCCCCGGGAG ACACTACGTC ACTGTG TATG GTTTGAAGAC 6840 
ATATCTAAGT ACGTAGGGGT CTCAATACCT GAAGACTGGA GTCTCAATCG CTGGAAATGT 6 90 0 
1190 GTGCGCGTCG CCAAAACCAC AGTGAGAGAG GTAGAGGAGA TAGCTTTCAG ATGTTTCAAG 6 96 0 
GAAAGTAAAG AATGGACTGA GAACATGAAA GCTGTCGCAT CTATCTTATC CGCCAAGTCG 7020 
TCGACTGTTA TTATTAACGG TCAGGCTATC ATGGCTGGTG AGCG CTT AG A CATTGAAGAT 7080 
TATCATCTAG TGGCCTTTGC TTTGACTTTG AATCTGTATC AAAAGTACGA AAAGCTTACG 714 0 
GCCCTCCGCG ATGGGATGGA ATGGAAAGGT TGGTGCCATC ACTTCAAAAC TAGGTTTTGG 7200 
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1200 TGGGGTGGAG ATTCATCCAG GGCGAAAGTA 
CCCCTACTAC GTCTGGATTC TTATGCGGAC 
GTTGAAGAAT TTGAGCAAGA TTCTGTACCG 

1205 

GAGGACTTAT TCGACCGGCT GGAGCATGAA 
AAGAAGGCGA AAGTCCCGCC AGCTGCTGAG 
1210 GAGAGTTCGA GCCCTGAGTC CGTCAGTGAT 
GATGCCGAGG TGTCTGTTGA GGTACCAACG 
ATGAAGGAAT TTGTGCGTTA TTGTAAGAGA 

1215 

CACCTATGGG ACATTTCCGG CGGTCGCGGA 
ACCTACCATC GCATAGACGA TATGGTGAAT 
1220 CCTAAAAAAT ACGATTACAC TGTTGGATAT 
GATGAAACGT ACATTGTTGA TAAAACATGT 
GCTAGCGCCA AAGTTTCTGT CCCTACATGC 

1225 

TGCGGTAAAA CCACTGCCAT AAAAGATGCA 
GCGAATCGTA AATCGGCCGA GGACGTCAGG 
1230 AAGGTAGCTT TGGACGTTGT GCGCACCGCG 
TGTCATAGGC TGCTTGTTGA TGAGGCTGGT 
GCTGCTCTGT CTAAATGTTC ACAAGTTCTT 

1235 

AAGTCTCGTG ACGCGGGTTT TAAATTGCTC 
GTTGTTCACA AGACTTACCG GTGTCCGCAA 
1240 CGTAAATGCG GTAATAGGGA CACGAAGTAT 
AGAAGTCTCA CGAAGCGTCG TATTACTTCT 
ACGTATCTTA CGATGACTCA AGCTGATAAA 

1245 

CCCGTGAGCA AGGACTGGAT TGATGGACAC 
TCTGTTGACA ACGTCACTTT GGTTCGGCTT 
1250 GAGGAGTACT GTTTGGTTGC CTTAACACGA 
GGCGAGCTCG CTGGTGATTT GATCTTTAAT 



23 

GGATGGCTGA GAACATTGGC TAGCAGATTT 726 0 

AGTTTTAAGT TTCTGACTCG TCTCTCAAAC 732 0 

ATATGACGTT TGAGAACGTT TTGGACTGAA 7 3 8 0 

GTGCAGACAG CCAAGACCAA GCGCTCGAAG 744 0 

ATACCTCAGG AGGAGTTTCA TGATGCCCCT 7500 

GACGTTAAAC CGGTGACTGA TGTGGTCCCG 7560 

GACCCTCGTG GCATATCTAG ACACGGAGCC 762 0 

TTACATAACA ACTCCGAGTC TAATCTTCGT 7680 

AGTGAGATCG CAAATAAGAG CATCTTTGAG 774 0 

GTCCATTTGG CCAACGGTAA CTGGTTGTAT 7800 

AATGAGCATG GTTTAGGTCC GAAGCACGCA 7860 

G CATGCTCTA ACTTGAGGGA CATTGCAGAA 7920 

GATATTTCCA TGGTTGATGG AGTTG CGGG A 7980 

TTCCGTATGG GAGAGGACCT AATTGTGACG 804 0 

ATGGCTTTAT TCCCTGACAC TTATAATTCC 8100 

GATTCTGCGA TCATGCACGG TGTACCGTCC 8160 

TTACTACATT ATGGTCAACT CCTGGTGGTG 822 0 

GCCTTTGGGG AC AC AG AG C A GATTT CGTTC 8280 

CACGGTAATC TGCAATATGA TCGCCGTGAC 8340 

GATGTTATCG CTG CTGTTAA TCTGCTGAAG 84 00 

CAATCCTGGA CATCTGAGTC CAAAG TTTCT 8460 

GGTTTGCAGG TCACTATTGA TCCGAACAGA 8 52 0 

GCGGCCCTTC AAACGAGGGC TAAGGATTTT 8580 

ATAAAAACAG TACACGAAGC GCAAGGGATC 8640 

AAGTCGACCA AATGTGATTT GTTTAAACAT 8700 

CACAAGAAGT CCTTTGAGTA TTG CTTTAAC 876 0 
TGTGTTAAGT GATGCGCTTG TCTCTGTGTG 8820 
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AGACCTCTGC TCGAGAATTC GAGCTCGGTA CCCGGGGATC CTCTAGAGTC CGCAAATCAC 8880 

1255 

CAGTCTCTCT CTACAAATCT ATCTCTCTCT ATTTTCTCCA GAATAATGTG TGAGTAGTTC 8940 
CCAGATAAGG GAATTAGGGT TCTTATAGGG TTTCGCTCAT GTGTTGAG C A TATAAGAAAC 9000 
1260 CCTTAGTATG TATTTGTATT TGTAAAATAC TTCTATCAAT AAAATTTCTA ATTCCTAAAA 9060 
CCAAAATCCA GTGACCGGGT GGTCAGTCCC TTATGTTACG TCCTGTAGAA ACCCCAACCC 912 0 
GTGAAATCAA AAAACTCGAC GGCCTGTGGG CATTCAGTCT GGATCGCGAA AACTGTGGAA 9180 

1265 

TTGATCAGCG TTGGTGGGAA AGCGCGTTAC AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 924 0 
GTTTTAACGA TCAGTTCGCC GATGCAGATA TTCGTAATTA TGCGGGCAAC GTCTGGTATC 93 00 
1270 AGCGCGAAGT CTTTATACCG AAAGGTTGGG CAGGCCAGCG TATCGTGCTG CGTTTCGATG 9360 
CGGTCACTCA TTACGGCAAA GTGTGGG TCA ATAATCAGGA AGTGATGGAG CATCAGGGCG 9420 
GCTATACGCC ATTTGAAGCC GATGTCACGC CGTATGTTAT TGCCGGGAAA AGTGTACAAT 9480 

1275 

TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 9540 
CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 9600 
1280 CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGNN NNNNNAATTC AGTACATTAA 9660 
AAACGTCCGC AATGTGTTAT TAAGTTGTCT AAGCGTCAAT TTGTTTACAC CACAATATAT 9720 
CCTGCCACCA GCCAGCCAAC AGCTCCCCGA CCGGCAGCTC GGCACAAAAT CACCACTCGA 9780 

1285 

TACAGGCAGC CCATCAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9840 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
1290 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 
NNNNNNNNNN NNNNNNNNNN 10100 



1295 



<210> 6 
<211> 10240 
1300 <212> DNA 

<213> Brome mosaic virus 



1305 



<400> 6 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 120 



WO 99/61597 



PCT/US99/11250 



25 

GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
1310 AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 240 
TAG C AAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 360 

1315 

TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 420 
AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGG CTGTC AGCGCAGGGG CGCCCGGTTC 480 
1320 TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
. CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 660 

1325 

TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 
ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
1330 GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAG CATCAG GGGCTCGCGC 84 0 
CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 

1335 

TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 
ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
1340 CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGC CTTCT TGACGAGTTC TTCTGANNNN 114 0 
NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 

1345 

TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 
CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 1380 
1350 TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGG CTCTG GTTCCGGTGA TTTTGATTAT 1560 

1355 

GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATG AC CG AAA ATGCCGATGA AAACGCGCTA 1620 
CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
1360 GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
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GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCAC CTTT AATGAATAAT 18 00 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC I86 0 
CCAATACGCA AACCG GCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 192 0 
AGGTTTCCCG ACTGGAAAGC GGG CAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 198 0 
1370 CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCATGC 2100 
CTGCAGGTCA CTGGATTTTG GTTTTAGGAA TTAGAAATTT TATTGATAGA AGTATTTTAC 216 0 

1375 

AAATACAAAT ACATACTAAG GGTTTCTTAT ATGCTCAACA CATGAGCGAA ACCCTATAAG 22 2 0 
AACCCTAATT CCCTTATCTG GGAACTACTC ACACATTATT CTGGAGAAAA TAGAGAGAGA 22 8 0 
1380 TAGATTTGTA GAGAGAGACT GGTGATTTGC GGACTCTAGA GGATCCCCAG CTTTTAAACT 234 0 
TAGCCAAAGT GGTCTGCCTG ACCAGGAGTT TTTAACCTTA ACCAAAGGGC TGTTCACAGC 24 0 0 
TTAGGTTCAT ATATCATAGA ACCGATCATC TCAGATCAGA GGGCTTAAAA GTCTCACAAT 246 0 

1385 

GGGACTTCAC GAGCAAAGCA TCAACTGACG TTAGGCCTCC TCTACCGGTA GCGTAATCGT 252 0 
CGACCTTCTT TTTCAAGCGT TGTGTGGTCC TACGATCATT AGCTAATTTG AGTGACTCAC 258 0 
1390 GCTCAAGGGC CTCATGTAAA CGTCCGATCC GTTTGACAGG GAGCTCCTTA GTACTACAGT 264 0 
CCGAGGAATA AATTCCAATG GTTCTGTAGA CTTTGTCTAA CACACCAGGA AACTTTGGAT 270 0 
TCTTCCAGTT GTGAAACCAG TCACCATCAG TTTTACGCTC TTCCGTGGTG CGTTTGAACT 276 0 

1395 

TACATACAGG ATCGCTCATC TGATAAACTC TGATGCCTTC GGTACAGTAG CAATCAGAGA 282 0 
ACCTCAGGAA ATTCTCGGAG TATAAAGAAA AAGCCGCAAG AGCAGCTCTA ACCTCCTCGA 288 0 
1400 AAATC CAAGG TTTTTCTTTC CCATATTTCA GATAAACAAA ATGACAGAGC GTCGTAATCA 294 0 
TCTTCTCATC AAGTTGATTA ATAAACTTCA TTCGATCACA GAAGGAAACG AAATGTGCTC 3000 
TGAGCATCTG TTCATCACGC AGAATCTTTC GCTTAGCTAA GCGCTGGATC TCTCTCAGAG 306 0 

1405 

GATCTGGTAC AGACACCAAA TTGCCCATTT CAGTTTCGAC GAGAAACTTA CTACAAACGT 3120 
AGGGCACACT AGGGTCCATG ACTTTTATCT CCATATTGAA GAGAGACGTA AACATATCGG 318 0 
1410 TATCCAGGAC TGGCTTAACT TTAGAGATGA TTAAAGAATC ATCTCCTGAA AATATTGCAC 324 0 
AGTCACAGTC ACTTAGATCA GAGGCATATG CAATCATAGC CATAGTGACA AGAGTATTAC 3300 
CGAAATATGT AAACGCGTCA CCAGTTCTGC GTTGGAAGGA AACGGACATT CCCACCTTGG 3360 

1415 
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CATGAGGGTC TGATAAATAA GAATCGCGAT 
CTGGAAAGCC CAGCGCAAGG AGTATCTCTC 
1420 ATTTATCAAA TTTGCTTAGG TCCGCTTCAA 
TAAGCTCCAG AGAGGATATC TTTCCGATAG 
ATAACTTCTC GAAACAAGCA GTGAAAAAGG 

1425 

GAAATGTTAT AGTAG CTGCT ACTGCTCGTT 
TTACG TCAG A CTTCAGCATA TGCTGGTACC 
1430 CACCTTGCAA GTCCATGTGC TTACCCCACT 
CATCCATAGA AGCTCTCAGA CAGTCTTCAC 
GCTTTGCTAT AGCTTTCGCA GTGTC CTTCA 

1435 

CCGCGTTTCG CTTTTTGAGT GCGGTTAAGA 
CACTCCCGAT ATTCATTTTT GGTTGAAAAT 
1440 CACTTTGCTT AAGTCTGATC CTATCAAAGT 
CTTGATGGTA CGAATCATCG AAAT AAG CAT 
CTGTTCTAGC AAGGCTGACT CTCTCCATAT 

1445 

CTGGATGGCA GTGTGTATCA ATAACACGCG 
GAAGAGTGAT GTCGTAATCA GTATTAGTAG 
1450 TATCTCCAAG GGTCAGCTCC TTGGGGGTAT 
AGTTCGAGGA ATCACTGGCG AGTTTTAGAT 
ATACACCGTA ACCATCACTA GTATCCTCGG 

1455 

CGAAAGAGCC CGGAATGTCA AGATATAACA 
GTCTATCGGT GAACTGTTGA ACCATTTGTT 
1460 CTCGGTCAAA AGGGGGATCG AATACCCCTC 
AACTCGCGAG AGATCCGTCA ATGGCAACTC 
CAGCCTCCAC CTCGTCTTCT AAGGATTGAT 

1465 

GAACGAAATC ATCATCCCAG GTTTTCGAAG 
CACAACAACA ACAAGGTCAG ATGTGTGTTG 
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GAAAATCAGA CCACCAATTC GTCAGCGGCG 3420 
TCTGAAACTC TAGGTGCAGC TCACCCTGAG 3480 
GAAAGTATCT GTTATTCAAG CGGACATTCT 3 54 0 
G CACAATG AA CCTGGATTTC AGGGCCAGTG 36 00 
GTGAAAAATT ACTAGTCACA CCTTTACTAT 36 60 
CCAAGTGAAG GGTGTCAGTT ACAACAGGTT 3720 
G AC AT AAAT C AGTCTCTGCT GCCACATTCA 3780 
TCTTATGGTA CTCAAGACAT TTAGTCATGA 384 0 
CGTCAACATT AAGGAATGTG CTACGAAAGC 3900 
TGTTAATCGC GTCTCCCATT TCTGGAACGT 3 960 
CTTCTTTCTG AGTACCAACT CTTCGCTGAG 4 020 
ATTTATCGGG GTCCCTATAC CAGTCTACAT 4 080 
CCATGGAATA ATCACCATTT TCAACAAGGG 4140 
GGGTTGGCAG TATGGAATGA CTGGTCGCTT 4200 
AAATTGGCCC AGTAGAGATG TCAGGGTTAT 4260 
AAACC CT ATG TTCAATAGGG TTCATGATTT 4 320 
TCTGAAACTC TTCATCAATG CCCATGTACC 438 0 
CTCCAGTAAC ACGAACTTCC TCAATTTCAC 444 0 
CGCTCGCATG ATCTTCATCG GCGGCAAACG 4 500 
GATACCAGTC ATCAATTTCA TCTTCGAGCA 4 56 0 
TCCGTGCCAT TTCAGCTTGA GGAATCAGCG 4620 
GGACGGTGTC G CAAAT AG AG CCCCAGCGCA 4 68 0 
CTATCTCCAA GGGCG CTATA GCTAATTTAA 4 74 0 
CGTCTGCCGG CTCCTGCACC TGAAGGCTAG 4800 
CTATGATCCA TTGGAAAGAC GGGACCTGGC 4860 
ACATCTTGGT GATAGTAGAA AGAACAAGCA 492 0 
CGGGTACCGA GCTCGAATTC TCGAGGTCCT 498 0 
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1470 CTCCAAATGA AATGAACTTC CTTATATAGA GGAAGGGTCT TGCGAAGGAT AGTGGGATTG 504 0 
■ TGCGTCATCC CTTACGTCAG TGGAGATATC ACATCAATCC ACTTGCTTTG AAGACGTGGT 5100 
TGGAACGTCT TCTTTTTCCA CGATGTTCCT ■ CGTGGGTGGG GGTCCATCTT TGGGACCACT S160 
GTCGGTAGAG GCATTCTTGA ACGATAGCCT TTCCTTTATC GCAATGATGG CATTTGTAGA 5220 
AGCCATCTTC CTTTTCTACT GTCCTTTCGA TGAAGTGACA GATAGCTGGG CAATGGAATC 5280 
CGAGGAGGTT TCCCGATATT ACCCTTTGTT GAAAAGTCTC AATAGCCCTC TGGTCTTCTG 5340 
AGACTGTATC TTTGATATTC TTGGAGTAGA CGAGAGTGTC GTGCTCCACC ATGTTGACCT 54 00 
GCAGGCAGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGT CACTGGATTT 5460 
TGGTTTTAGG AATTAGAAAT TTTATTGATA GAAGTATTTT ACAAATACAA ATACATACTA 5520 
AGGGTTTCTT ATATGCTCAA CACATGAGCG AAACCCTATA AGAACCCTAA TTCCCTTATC 5580 
TGGGAACTAC TCACACATTA TTCTGGAGAA AATAGAGAGA GATAGATTTG TAGAGAGAGA 5640 
CTGGTGATTT GCGGACTCTA GAGGATCCCC GGGTACCGAG CTCGAATTCT CGAGCAGAGG 5700 
TCTCACACAG AGACAAG CGC ATCACTTAAC ACAATTAAAG ATCAAATCAC CAGCGAGCTC 5760 
GCCGTTAAAG CAATACTCAA AGGACTTCTT GTGTCGTGTT AAGGCAACCA AACAGTACTC 5820 
CTCATGTTTA AACAAATCAC ATTTGGTCGA CTTAAGCCGA ACCAAAGTGA CGTTGTCAAC 5880 
AGAGATCCCT TGCGCTTCGT GTACTGTTTT TATGTGTCCA TCAATCCAGT CCTTGCTCAC 5940 
GGGAAAATCC TTAGCCCTCG TTTGAAGGGC CGCTTTATCA GCTTGAGTCA TCGTAAGATA 6000 
CGTTCTGTTC GGATCAATAG TGACCTGCAA ACCAGAAGTA ATACGACGCT TCGTGAGACT 6060 
TCTAGAAACT TTGGACTCAG ATGTCCAGGA TTGATACTTC GTGTCCCTAT TACCGCATTT 6120 
ACGCTTCAGC AGATTAACAG CAGCGATAAC ATCTTGCGGA CACCGGTAAG TCTTGTGAAC 6180 
1510 AACGTCACGG CGATCATATT GCAGATTACC GTGGAGCAAT TTAAAACCCG CGTCACGAGA 6240 
CTTGAACGAA ATCTGCTCTG TGTCCCCAAA GGCAAGAACT TGTGAACATT TAGAGAGAGC 6300 
AGCCACCACC AGGAGTTGAC CATAATGTAG TAAACCAGCC TCATCAACAA GCAGCCTATG 6360 
1515 ACAGGACGGT ACACCGTGCA TGATCGCAGA ATCCGCGGTG CGCACAACGT CCAAAGCTAC 6420 
CTTGGAATTA TAAGTGTCAG GGAATAAAGC CATCCTGACG TCCTCGGCCG ATTTACGATT 6480 
1520 CGCCGTCACA ATTAGGTCCT CTCCCATACG GAATGCATCT TTTATGGCAG TGGTTTTACC 6 540 
GCATCCCGCA ACTCCATCAA CCATGGAAAT ATCGCATGTA GGGACAGAAA CTTTGGCGCT 6600 
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AGCTTCTGCA ATGTCCCTCA AGTTAGAGCA TGCACATGTT TTATCAACAA TGTACGTTTC 6 66 0 
ATCTG CGTGC TTCGGACCTA AACCATGCTC ATTATATCCA ACAGTGTAAT CGTATTTTTT 6720 
AGGATACAAC CAGTTACCGT TGGCCAAATG GACATTCACC ATATCGTCTA TGCGATGGTA 6 780 
1530 GGTCTCAAAG ATGCTCTTAT TTGCGATCTC ACTTCCGCGA CCGCCGGAAA TGTCCCATAG 684 0 
GTGACGAAGA TTAGACTCGG AGTTGTTATG TAATCTCTTA CAATAACGCA CAAATTCCTT 6 900 
CATGGCTCCG TGTCTAGATA TGCCACGAGG GTCCGTTGGT ACCTCAACAG ACACCTCGGC 6 960 

1535 

ATCCGGGACC ACATCAGTCA CCGGTTTAAC GTCATCACTG ACGGACTCAG GGCTCGAACT 7 02 0 
CTCAGGGGCA TCATGAAACT CCTCCTGAGG TATCTCAGCA GCTGGCGGGA CTTTCGCCTT 7 08 0 
1540 CTTCTTCGAG CGCTTGGTCT TGGCTGTCTG CACTTCATG C TCCAGCCGGT CGAATAAGTC 714 0 
CTCTTCAGTC CAAAACGTTC TCAAACGTGA TATOGGTACA GAATCTTGCT CAAATTCTTC 72 00 
AACGTTTGAG AGACGAGTCA GAAACTTAAA ACTGTCCGCA TAAGAATCCA GACGTAGTAG 726 0 

1545 

GGGAAATCTG CTAGCCAATG TTCTCAGCCA TCCTACTTTC GCCCTGGATG AATCTCCACC 732 0 
CCACCAAAAC CTAGTTTTGA AGTGATGGCA CCAACCTTTC CATTCCATCC CATCGCGGAG 738 0 
1550 GGCCGTAAGC TTTTCGTACT TTTGATACAG ATTCAAAGTC AAAGCAAAGG CCACTAGATG 744 0 
ATAAT CTTCA ATGTCTAAGC GCTCACCAGC CATGATAGCC TGACCGTTAA TAATAACAGT 7500 
CGACGACTTG G CGGATAAG A TAGATGCGAC AGCTTTCATG TTCTCAGTCC ATTCTTTACT 756 0 

1555 

TTC CTTG AAA CATCTGAAAG CTATCTCCTC TACCTCTCTC ACTGTGGTTT TGGCGACGCG 7620 
CACACATTTC CAGCGATTGA GACTCCAGTC TTCAGGTATT GAGACCCCTA CGTACTTAGA 7680 
1560 TATGTCTTCA AACCATACAC AGTGACGTAG TGTCTCCCGG GGGCAGCGTA AATTTGTAGC 774 0 
GATGATCTTA TAGGTCATGA TGTTACATTT CAGCATTTCG CGCTCCAACA GATAGGTGGT 7800 
TCCATCGATG CAATGCACCG ACTCGGTGAA AAATGAGCCC AAATCTTGCC ATC CGTGG AT 7860 

1565 

GTAAG ATAAT GTGCTTTCAT TTTCAAAATC GAATTTGATC ACCTCATCCG CGCCTGACCC 7 92 0 
GTCACGTTGC CAGTGACATT TAAGCAAGGG AAGAAAACCC TCGCGGTCAA ACAACATGGC 7980 
1570 GCCGTCGAAC ATAACGGTAC CACGTAGTAC GCGTACTCCA TGCGAATGCA TGGCGTCACA 8040 
CAGACCTTGG AAGCCCATAT CATAACCGCC GTGGATACAG ATAGCCCAAT CAGCTTGGAC 8100 
ATCACAATCT TGAGCTCGGT TAAGACAAAA GTTCGGGACT . TCATCGAAAT CATCGCTTTC 8160 
TTGCAAAATT TTTCGCATGC GGCACATCCT CTCCTCATGT CGGGCAGCGT CTCTAACACC 8220 



1575 
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CAACACAGGA CAACAACTGT GCACCCTTTT ATCCCTTCTT GAAAAGTGAT GCCACCAAGA 8280 
1580 GCCTCCGAAA TCTATAACGG GGTCTTCAGG GGGAAAACTG TCGAGACAGT CATAATGCTC 834 0 
CGCTACACGC AGAGCACCAG CCAGGCTATG GGGCGCATGA TACTGCTGAG TCAAATTTAA 8400 
GTCAAAGGCA CCACCATAAC GGTCACGGAA GGCGTCAGCC TCCTCAATAG AGAGCTTATT 8460 

1585 

GCGAACGTTG ATTTTCTTAG ACCTTTTCGC GTATTCAATC TGCGCAGATA ACTGTTGCGC 852 0 
AACCTGATTG TCTACGATGT CTTGGGCACT CTGGCTGTCA GCACCCTTCT CAGCAATCAA 8 580 
1590 CTTCAGCAAA TCGATAGAAC TTGACATTTT GTTGGTGAAA AACAAAGAAC AAGTAGCAGA 8640 
ACCGTGGTCG AGGTCCTCTC CAAATGAAAT GAACTTCCTT ATATAGAGGA AGGGTCTTGC 87 00 
GAAGGATAGT GGGATTGTGC GTCATCCCTT ACGTCAGTGG AGATATCACA TCAATCCACT 8760 

1595 

TGCTTTGAAG ACGTGGTTGG AACGTCTTCT TTTTC CACGA TGTTCCTCGT GGGTGGGGGT 8820 
CCATCTTTGG GACCACTGTC GGTAGAGGCA TTCTTGAACG ATAGCCTTTC CTTTATCGCA 8880 
1600 ATGATGGCAT TTGTAGAAGC CATCTTCCTT TTCTACTGTC CTTTCGATGA AGTGACAGAT 894 0 
AGCTGGGCAA TGGAATCCGA GGAGGTTTCC CGATATTACC CTTTGTTGAA AAGTCTCAAT 9000 
AGCCCTCTGG TCTTCTGAGA CTGTATCTTT GATATTCTTG GAGTAGACGA GAGTGTCGTG 9060 

1605 

CTCCACCATG TTGACCGGGT GGTCAGTCCC TTATGTTACG TCCTGTAGAA ACCCCAACCC 912 0 
GTGAAATCAA AAAACTCGAC GGCCTGTGGG CATTCAGTCT GGATCGCGAA AACTGTGGAA 9180 
1610 TTGATCAGCG TTGGTGGGAA AGCGCGTTAC AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 924 0 
GTTTTAACGA TCAGTTCGCC GATGCAGATA TTCGTAATTA TGCGGGCAAC GTCTGGTATC 9300 
AGCGCGAAGT CTTTATACCG AAAGGTTGGG CAGGCCAGCG TATCGTG CTG CGTTTCGATG 9360 

1615 

CGGTCACTCA TTACGGCAAA GTGTGGGTCA ATAATCAGGA AGTGATGGAG CATCAGGGCG 9420 
GCTATACGCC ATTTGAAGCC GATGTCACGC CGTATGTTAT TGCCGGGAAA AGTGTACAAT 9480 
1620 TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 9540 
CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 9600 
CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGNN NNNNNAATTC AGTACATTAA 966 0 

1625 Mon 

AAACGTCCGC AATGTGTTAT TAAGTTGTCT AAGCGTCAAT TTGTTTACAC CACAATATAT 972 0 
CCTGCCACCA GCCAGCCAAC AGCTCCCCGA CCGGCAGCTC GGCACAAAAT CACCACTCGA 9780 
1630 TACAGGCAGC CCATCAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9840 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 

1635 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 
1640 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10140 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10200 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10240 

1645 

<210> 7 
<211> 10272 
<212> DNA 

<213> Brome mosaic virus 

1650 

<400> 7 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 60 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 12 0 

1655 

GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 180 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 24 0 
1660 TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 300 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 36 0 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGG CTATGAC TGGGCACAAC 42 0 

1665 

AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 480 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 540 
1670 TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 66 0 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GGGGCGGCTG CATACGCTTG 720 

1675 

ATCCGGCTAC CTGCCCATTC GACCAC CAAG CGAAACATCG CATCGAGCGA GCACGTACTC 780 
GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 840 
1680 CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGC CGCTTT TCTGGATTCA 96 0 
TCGACTGTGG CCGGCTGGGT GTGG CGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 

1685 
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ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
1690 NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 1260 
TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 

1695 

CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 13 80 
TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 1440 
1700 TCTGGTGGTG GTTCTGGTGG CGG CTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGGCGGCT CTGAGGGAGG CGGTTCCGGT GGTGG CTCTG GTTCCGGTGA TTTTGATTAT 1560 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 162 0 

1705 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 168 0 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTG CTACTGG TGATTTTGCT 174 0 
1710 GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATTCACCTTT AATGAATAAT 1800 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 186 0 
CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 192 0 

1715 

AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATGTGAG TTAGCTCACT 198 0 
CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 2040 
1720 AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCTGCC 2100 
TGCAGGTCAA CATGGTGGAG CACGACACTC TCGTCTACTC CAAGAATATC AAAGATACAG 2160 
TCTCAGAAGA CCAGAGGGCT ATTGAGACTT TTCAACAAAG GGTAATATCG GGAAACCTCC 222 0 

1725 

TCGGATTCCA TTGCCCAGCT ATCTG TC ACT TCATCGAAAG GACAGTAGAA AAGGAAGATG 2280 
GCTTCTACAA ATGCCATCAT TGCGATAAAG GAAAGGCTAT CGTTCAAGAA TGCCTCTACC 2340 
1730 GACAGTGGTC CCAAAGATGG ACCCCCACCC ACGAGGAACA TCGTGGAAAA AGAAGACGTT 2400 
CCAACCACGT CTTCAAAGCA AGTGGATTGA TGTGATATCT CCACTGACGT AAGGGATGAC 2460 
GCACAATCCC ACTATCCTTC GCAAGACCCT TCCTCTATAT AAGGAAGTTC ATTTCATTTG 2520 

1735 

GAGAGGACCT CGAGAATTCG AGCTCGGTAC CCGCAACACA CATCTGACCT TGTTGTTGTT 2580 
GTGTGCTTGT TCTTTCTACT ATCAC CAAG A TGTCTTCGAA AACCTGGGAT GATGATTTCG 2640 
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1740 TTCGCCAGGT CCCGTCTTTC CAATGGATCA 
CTGCTAGCCT TCAGGTGCAG GAGCCGGCAG 
GTTTTAAATT AGCTATAGCG CCCTTGGAGA 

1745 

GAGTGCGCTG GGG CTCTATT TGCGACACCG 
GACCGCTGAT TCCTCAAGCT GAAATGGCAC 
1750 TCGTGCTCGA AGATGAAATT GATGACTGGT 
TATCGTTTGC CGC CGATGAA GATCATG CGA 
ACTGTGAAAT TGAGGAAGTT CGTGTTACTG 

1755 

ATAGGTACAT GGGCATTGAT GAAGAGTTTC 
TTCAAATCAT GAACCCTATT GAACATAGGG 
1760 CAGATAACCC TGACATCTCT ACTGGGCCAA 
CAGAAGCGAC CAGTCATTCC ATACTGCCAA 
AAG CCCTTGT TGAAAATGGT GATTATTCCA 

1765 

GTGATGTAGA CTGGTATAGG GACCCCGATA 
GTGCTCAGCG AAGAGTTGGT ACTCAGAAAG 
1770 CGGACGTTCC AGAAATGGGA GACGCGATTA 
AGCGCTTTCG TAGCACATTC CTTAATGTTG 
ATGTCATGAC TAAATGTCTT GAGTACCATA 

1775 

GTGTGAATGT GGCAGCAGAG ACTGATTTAT 
TAAAACCTGT TGTAACTGAC ACCCTTCACT 
1780 TTCATAGTAA AGGTGTGACT AGTAATTTTT 
TATCACTGGC CCTGAAATCC AGGTTCATTG 
TTAAGAATGT CCG CTTGAAT AACAGATACT 

1785 

AATCTCAGGG TGAGCTGCAC CTAGAGTTTC 
CAGCGCCGCT GACGAATTGG TGGTCTGATT 
1790 ATGCCAAGGT GGGAATGTCC GTTTCCTTCC 
TCGGTAATAC TCTTGTCACT ATGGCTATGA 
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TAGATCAATC CTTAGAAGAC GAGGTGGAGG 2700 
ACGGAGTTGC CATTGACGGA TCTCTCGCGA 2760 
TAGGAGGGGT ATTCGATCCC CCTTTTGACC 282 0 
TCCAACAAAT GGTTCAACAG TTCACCGATA 2 88 0 
GGATGTTATA TCTTGACATT CCGGGCTCTT 294 0 
ATCCCGAGGA TACTAGTGAT GGTTACGGTG 3000 
GCGATCTAAA ACTCGCCAGT GATTCCTCGA 3060 
GAGATACCCC CAAGGAGCTG ACCCTTGGAG 3120 
AGACTACTAA TACTGATTAC GACATCACTC 3180 
TTTCGCGTGT TATTGATACA CACTGCCATC 324 0 
TTTATATGGA GAGAGTCAGC CTTGCTAGAA 3300 
CCCATGCTTA TTTCGATGAT TCGTACCATC 3360 
TGGACTTTGA TAGGATCAGA CTTAAGCAAA 3420 
AATATTTTCA ACCAAAAATG AATATCGGGA 348 0 
AAGTCTTAAC CGCACTCAAA AAGCGAAACG 354 0 
ACATGAAGGA CACTGCGAAA GCTATAGCAA 36 00 
ACGGTGAAGA CTGTCTGAGA GCTTCTATGG 36 60 
AGAAGTGGGG TAAGCACATG GACTTGCAAG 372 0 
GTCGGTACCA GCATATGCTG AAGTCTGACG 3780 
TGGAACGAGC AGTAGCAGCT ACTATAACAT 384 0 
CACCCTTTTT CACTGCTTGT TTCGAGAAGT 3900 
TGCCTATCGG AAAGATATCC TCTCTGGAGC 3960 
TTCTTGAAGC GGACCTAAGC AAATTTGATA 4020 
AGAGAGAGAT ACTCCTTGCG CTGGGCTTTC 4080 
TTCATCGCGA TTCTTATTTA TCAGACCCTC 4140 
AACGCAGAAC TGGTGACGCG TTTACATATT 42 00 
TTGCATATGC CTCTGATCTA AGTGACTGTG 4260 
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ACTGTGCAAT ATTTTCAGGA GATGATTCTT TAATCATCTC TAAAGTTAAG CCAGTCCTGG 4 32 0 

1795 

ATACCGATAT GTTTACGTCT GTCTTCAATA TGGAGATAAA AGTCATGGAC CCTAGTGTGC 43 80 
CCTACGTTTG TAGTAAGTTT CTCGTCGAAA CTGAAATGGG CAATTTGGTG TCTGTACCAG 444 0 
1800 ATCCTCTGAG AGAGATCCAG CGCTTAGCTA AGCGAAAGAT TCTGCGTGAT GAACAGATGC 4 500 
TCAGAGCACA TTTCGTTTCC TTCTGTGATC GAATGAAGTT TATTAATCAA CTTGATGAGA 4 56 0 
AGATGATTAC GACGCTCTGT CATTTTGTTT ATCTGAAATA TGGGAAAGAA AAACCTTGGA 4620 

1805 

TTTTCGAGGA GGTTAGAGCT GCTCTTGCGG CTTTTTCTTT ATACTCCGAG AATTTCCTGA 4680 
GGTTCTCTGA TTGCTACTGT ACCGAAGGCA TCAGAGTTTA TCAGATGAGC GATCCTGTAT 4 74 0 
1810 GTAAGTTCAA ACGCACCACG GAAGAGCGTA AAACTGATGG TGACTGGTTT CACAACTGGA 4800 
AGAATCCAAA GTTTCCTGGT GTGTTAGACA AAGTCTACAG AACCATTGGA ATTTATTCCT 4 860 
CGGACTGTAG TACTAAGGAG CTCCCTGTCA AACGGATCGG ACGTTTACAT GAGGCCCTTG 4 92 0 

1815 

AGCGTGAGTC ACTCAAATTA GCTAATGATC GTAGGACCAC ACAACGCTTG AAAAAGAAGG 4 980 
TCGACGATTA CGCTACCGGT AGAGGAGGCC TAACGTCAGT TGATGCTTTG CTCGTGAAGT 5040 
1820 CCCATTGTGA GACTTTTAAG CCCTCTGATC TGAGATGATC GGTTCTATGA TATATGAACC 5100 
TAAGCTGTGA ACAGCCCTTT GGTTAAGGTT AAAAACTCCT GGTCAGGCAG ACCACTTTGG 5160 
CTAAGTTTAA AAGCTGGGGA TCCTCTAGAG TCCGCAAATC ACCAGTCTCT CTCTACAAAT 522 0 

1825 

CTATCTCTCT CTATTTTCTC CAGAATAATG TGTGAGTAGT TCCCAGATAA GGGAATTAGG 5280 
GTTCTTATAG GGTTTCGCTC ATGTGTTGAG CATATAAGAA ACCCTTAGTA TGTATTTGTA 534 0 
1830 TTTGTAAAAT ACTTCTATCA ATAAAATTTC TAATTCCTAA AACCAAAATC CAGTGACCTG 5400 
CAGGCATGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGT CAACATGGTG 546 0 
GAGCACGACA CTCTCGTCTA CTCCAAGAAT ATCAAAGATA CAGTCTCAGA AGACCAGAGG 5520 

1835 

GCTATTGAGA CTTTTCAACA AAGGGTAATA TCGGGAAACC TCCTCGGATT CCATTGCCCA 5580 
GCTATCTGTC ACTTCATCGA AAGGACAGTA GAAAAGGAAG ATGGCTTCTA CAAATGCCAT 5640 
1840 CATTGCGATA AAGGAAAGGC TATCGTTCAA GAATGCCTCT ACCGACAGTG GTCCCAAAGA 5700 
TGGACCCCCA CCCACGAGGA ACATCGTGGA. AAAAGAAGAC GTTCCAACCA CGTCTTCAAA 5760 

GCAAGTGGAT TGATGTGATA TCTCCACTGA CGTAAGGGAT GACGCACAAT CCCACTATCC 5820 

1845 coon 
TTCGCAAGAC CCTTCCTCTA TATAAGGAAG TTCATTTCAT TTGGAGAGGA CCTCGACCAC 5880 
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GGTTCTG CT A CTTGTTCTTT GTTTTTCACC AACAAAATGT CAAGTTCTAT CGATTTGCTG 5940 
1850 AAGTTGATTG CTGAGAAGGG TGCTGACAGC CAGAGTGCCC AAGACATCGT AGACAATCAG 6000 
GTTGCGCAAC AGTTATCTGC GCAGATTGAA TACGCGAAAA GGTCTAAGAA AATCAACGTT 606 0 
CGCAATAAGC TCTCTATTGA GGAGGCTGAC GCCTTCCGTG ACCGTTATGG TGGTGCCTTT 612 0 

1855 

GACTTAAATT TGACTCAGCA GTATCATGCG CCCCATAGCC TGGCTGGTGC TCTGCGTGTA 6180 
GCGGAGCATT ATGACTGTCT CGACAGTTTT CCCCCTGAAG ACCCCGTTAT AGATTTCG G A 624 0 
1860 GGGTCTTGGT GGCATCACTT TTCAAGAAGG GATAAAAGGG TGCACAGTTG TTGTCCTGTG 6300 
TTGGGTGTTA GAGACGCTGC CCGACATGAG GAGAGGATGT GCCGCATGCG AAAAATTTTG 6360 
CAAGAAAGCG ATGATTTCGA TGAAGTCCCG AACTTTTGTC TTAACCGAGC TCAAGATTGT 642 0 

1865 

GATGTCCAAG CTGATTGGGC TATCTGTATC CACGGCGGTT ATGATATGGG CTTCCAAGGT 64 8 0 
CTGTGTGACG CCATGCATTC GCATGGAGTA CGCGTACTAC GTGGTACCGT TATGTTCGAC 654 0 
1870 GGCGCCATGT TGTTTGACCG CGAGGGTTTT CTTCCCTTGC TTAAATGTCA CTGGCAACGT 6600 
GACGGGTCAG GCGCGGATGA GGTGATCAAA TTCGATTTTG AAAATGAAAG CACATTATCT 666 0 
TACATCCACG GATGGCAAGA TTTGGGCTCA TTTTTCACCG AGTCGGTGCA TTGCATCGAT 6720 

1875 

GGAACCACCT ATCTGTTGGA GCG CGAAATG CTGAAATGTA ACATCATGAC CTATAAGATC 678 0 
ATCGCTAGAA ATTTACGCTG CCCCCGGGAG ACACTACGTC ACTGTGTATG GTTTGAAGAC 6 84 0 
1880 ATATCTAAGT ACGTAGGGGT CTCAATACCT GAAGACTGGA GTCTCAATCG CTGGAAATGT 6 900 
GTGCGCGTCG CCAAAACCAC AGTGAGAGAG GTAGAGGAGA TAGCTTTCAG ATGTTTCAAG 6 96 0 
GAAAGTAAAG AATGGACTGA GAACATGAAA GCTGTCGCAT CTATCTTATC CGCCAAGTCG 7020 

1885 

TCGACTGTTA TTATTAACGG TCAGG CTATC ATGGCTGGTG AGCGCTTAGA CATTGAAGAT 7 080 
TATCATCTAG TGGCCTTTGC TTTGACTTTG AATCTGTATC AAAAGTACGA AAAGCTTACG 714 0 
1890 GCCCTCCGCG ATGGGATGGA ATGGAAAGGT TGGTGCCATC ACTTCAAAAC TAGGTTTTGG 7200 
TGGGGTGGAG ATTCATCCAG GGCGAAAGTA GGATGGCTGA GAACATTGGC TAGCAGATTT 7260 
CCCCTACTAC GTCTGGATTC TTATGCGGAC AGTTTTAAGT TTCTGACTCG TCTCTCAAAC 7320 

1895 

. GTTGAAGAAT TTGAGCAAGA TTCTGTACCG ATATCACGTT TGAGAACGTT TTGGACTGAA 7380 
GAGGACTTAT TCGACCGGCT GGAGCATGAA GTGCAGACAG CCAAGACCAA GCGCfCGAAG 744 0 
1900 AAGAAGGCGA AAGTCCCGCC AGCTGCTGAG ATACCTCAGG AGGAGTTTCA TGATGCCCCT 7500 



RNsnrmin <wo o»ti so7A? ia> 



1905 



1925 



WO 99/61597 PCT/US99/11250 

36 

GAGAGTTCGA GCCCTGAGTC CGTCAGTGAT GACGTTAAAC CGGTGACTGA TGTGGTCCCG 756 0 
GATGCCGAGG TGTCTGTTGA GGTACCAACG GACCCTCGTG GCATATCTAG ACACGGAGCC 7620 
ATGAAGGAAT TTGTGCGTTA TTGTAAGAGA TTACATAACA ACTCCGAGTC TAATCTTCGT 7680 
CACCTATGGG ACATTTCCGG CGGTCGCGGA AGTGAGATCG CAAATAAGAG CATCTTTGAG 774 0 
1910 ACCTACCATC GCATAGACGA TATGGTGAAT GTCCATTTGG CCAACGGTAA CTGGTTGTAT 7800 
CCTAAAAAAT ACGATTACAC TGTTGGATAT AATGAGCATG GTTTAGGTCC GAAGCACGCA 786 0 
GATGAAACGT ACATTGTTGA TAAAACATGT GCATG CTCTA ACTTGAGGGA CATTGCAGAA 7920 

1915 

GCTAGCGCCA AAGTTTCTGT CCCTACATGC GATATTTCCA TGGTTGATGG AGTTGCGGGA 7980 
TGCGGTAAAA CCACTGCCAT AAAAGATGCA TTCCGTATGG GAGAGGACCT AATTGTGACG 8040 
1920 GCGAATCGTA AATCGGCCGA GGACGTCAGG ATGGCTTTAT TCCCTGACAC TTATAATTCC 8100 
AAGGTAGCTT TGGACGTTGT GCGCACCGCG GATTCTGCGA TCATGCACGG TGTACCGTCC 8160 
TGTCATAGGC TGCTTGTTGA TGAGGCTGGT TTACTACATT ATGGTCAACT CCTGGTGGTG 8220 
GCTGCTCTGT CTAAATGTTC ACAAGTTCTT GCCTTTGGGG ACACAGAGCA GATTTCGTTC 8280 
AAGTCTCGTG ACGCGGGTTT TAAATTGCTC CACGGTAATC TGCAATATGA TCGCCGTGAC 8340 
1930 GTTGTTCACA AGACTTACCG GTGTCCGCAA GATGTTATCG CTG CTGTTAA TCTGCTGAAG 84 00 
CGTAAATGCG GTAATAGGGA CACGAAGTAT CAATCCTGGA CATCTGAGTC CAAAGTTTCT 8460 
AGAAGTCTCA CGAAGCGTCG TATTACTTCT GGTTTGCAGG TCACTATTGA TCCGAACAGA 8520 
ACGTATCTTA CGATGACTCA AGCTGATAAA GCGGCCCTTC AAACGAGGGC TAAGGATTTT 8580 
CCCGTGAGCA AGGACTGGAT TGATGGACAC ATAAAAACAG TACACGAAGC GCAAGGGATC 864 0 
1940 TCTGTTGACA ACGTCACTTT GGTTCGGCTT AAGTCGACCA AATGTGATTT GTTTAAACAT 8700 
GAGGAGTACT GTTTGGTTGC CTTAACACGA CACAAGAAGT CCTTTGAGTA TTGCTTTAAC 8760 
GGCGAGCTCG CTGGTGATTT GATCTTTAAT TGTGTTAAGT GATGCGCTTG TCTCTGTGTG 8820 
AGACCTCTGC TCGAGAATTC GAGCTOGGTA CCCGGGGATC CTCTAGAGTC CG CAAATC AC 8880 
CAGTCTCTCT CTACAAATCT ATCTCTCTCT ATTTTCTC C A GAATAATGTG TGAGTAGTTC 8940 
1950 CCAGATAAGG GAATTAGGGT TCTTATAGGG TTTCGCTCAT GTGTTGAGCA TATAAGAAAC 9000 
CCTTAGTATG TATTTGTATT TGTAAAATAC TTCTATCAAT AAAATTTCTA ATTCCTAAAA 9060 
CCAAAATCCA GTCACCGGGT GGTCAGTCCC TTATGTTACG TCCTGTAGAA ACCCCAACCC 9120 

1955 
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GTGAAATCAA AAAACTCGAC GGCCTGTGGG CATTCAGTCT GGATCGCGAA. AACTGTGGAA 9180 
TTGATCAGCG TTGGTGGGAA AGCGCGTTAC AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 924 0 
1960 GTTTTAACGA TCAGTTCGCC GATGCAGATA TTCGTAATTA TGCGGGCAAC GTCTGGTATC 93 00 
AGCGCGAAGT CTTTATACCG AAAGGTTGGG CAGGCCAGCG TATCGTGCTG CGTTTCGATG 936 0 
CGGTCACTCA TTACGGCAAA GTGTGGGTCA ATAATCAGGA AGTGATGGAG CATCAGGGCG 9420 

1965 

GCTATACGCC ATTTGAAGCC GATGTCACGC CGTATGTTAT TGCCGGGAAA AGTGTACAAT 94 80 
TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 954 0 
1970 CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 96 00 
CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGNN NNNNNAATTC AGTACATTAA 9660 
AAACGTCCGC AATGTGTTAT TAAGTTGTCT AAGCGTCAAT TTGTTTACAC CACAATATAT 9720 

1975 

CCTGCCACCA GCCAGCCAAC AGCTCCCCGA CCGGCAGCTC GGCACAAAAT CACCACTCGA 97 80 
TACAGGCAGC CCATCAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 984 0 
1980 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 

1985 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10140 
1990 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10200 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10260 
NNNNNNNNNN NN 10272 

1995 

<210> 8 
<211> 10166 
<212> DNA 

<213> Brome mosaic virus 

2000 

<400> 8 

AAACACTGAT AGTTTAAACT GAAGGCGGGA AACGACAATC TGATCATGAG CGGAGAATTA 6 0 
AGGGAGTCAC GTTATGACCC CCGCCGATGA CGCGGGACAA GCCGTTTTAC GTTTGGAACT 12 0 

2005 

GACAGAACCG CAACGATTGA AGGAGCCACT CAGCCGCGGG TTTCTGGAGT TTAATGAGCT 18 0 
AAGCACATAC GTCAGAAACC ATTATTGCGC GTTCAAAAGT CGCCTAAGGT CACTATCAGC 24 0 
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2010 TAGCAAATAT TTCTTGTCAA AAATGCTCCA CTGACGTTCC ATAAATTCCC CTCGGTATCC 3 00 
AATTAGNNNN NNNNNNNNNN NNNNNNNNNN GATCGTTTCG CATGATTGAA CAAGATGGAT 36 0 
TGCACGCAGG TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC 42 0 

2015 

AGACAATCGG CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC 48 0 
TTTTTGTCAA GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC 54 0 
2020 TATCGTGGCT GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG 600 
CGGGAAGGGA CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC 66 0 
TTGCTCCTGC CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG 720 

2025 

ATCCGGCTAC CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC 78 0 
GGATGGAAGC CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC 84 0 
2030 CAGCCGAACT GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGATGAT CTCGTCGTGA 900 
CCCATGGCGA TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA 960 
TCGACTGTGG CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG 1020 

2035 

ATATTGCTGA AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG 1080 
CCGCTCCCGA TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGANNNN 1140 
2040 NNNNNNNNNN NNNNNNNNNN GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 1200 
ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 126 0 
TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 1320 

2045 

CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 138 0 
TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGGGCCTC CTGTCAATGC TGGCGGCGGC 144 0 
2050 TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 1500 
GGTGG CGGCT CTGAGGGAGG CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT 1560 
GAAAAGATGG CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1620 

2055 1 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC TGCTATCGAT 1680 
GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG GTGCTACTGG TGATTTTGCT 174 0 
2060 GGCTCTAATT CCCAAATGGC TCAAGTCGGT GACGGTGATA ATT CACCTTT AATGAATAAT 1800 
TTCCGTCAAT ATTTACCTTC CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTGGC 1860 
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CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG CTGGCACGAC 192 0 
AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA TTAATG TG AG TTAGCTCACT 1980 
CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC GTATGTTGTG TGGAATTGTG 204 0 
AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTTGCTGCC 2100 
TGCAGGTCAA CATGGTGGAG CACGACACTC TCGTCTACTC CAAGAATATC AAAGATACAG 2160 
TCTCAGAAGA CCAGAGGGCT ATTGAGACTT TTCAACAAAG GGTAATATCG GGAAACCTCC 2220 
TCGGATTCCA TTGCCCAGCT ATCTGTCACT TCATCGAAAG GACAGTAGAA AAGGAAGATG 2280 
GCTTCTACAA ATGCCATCAT TGCGATAAAG GAAAGGCTAT CGTTCAAGAA TGCCTCTACC 234 0 
GACAGTGGTC CCAAAGATGG ACCCCCACCC ACGAGGAACA TCGTGGAAAA AGAAGACGTT 2400 
CCAACCACGT CTTCAAAGCA AGTGGATTGA TGTGATATCT CCACTGACGT AAGGGATGAC 2460 
GCACAATCCC ACTATCCTTC GCAAGACCCT TCCTCTATAT AAGGAAGTTC ATTTCATTTG 2520 
GAGAGGACCT CGAGAATTCG AGCTCGGTAC CCGCAACACA CATCTGACCT TGTTGTTGTT 2580 
GTGTGCTTGT TCTTTCTACT ATCACCAAGA TGTCTTCGAA AACCTGGGAT GATGATTTCG 264 0 
TTCGCCAGGT CCCGTCTTTC CAATGGATCA TAGATCAATC CTTAGAAGAC GAGGTGGAGG 2700 
CTGCTAGCCT TCAGGTGCAG GAGCCGG CAG ACGGAGTTGC CATTGACGGA TCTCTCGCGA 2760 
GTTTTAAATT AGCTATAGCG CCCTTGGAGA TAGGAGGGGT ATTCGATCCC CCTTTTGACC 2 820 
GAGTGCGCTG GGGCTCTATT TGCGACACCG TCCAACAAAT GGTTCAACAG TTCACCGATA 2880 
GACCGCTGAT TCCTCAAGCT GAAATGGCAC GGATGTTATA TCTTGACATT CCGGGCTCTT 2940 
TCGTGCTCGA AGATGAAATT GATGACTGGT ATCCCGAGGA TACTAGTGAT GGTTACGGTG 3000 
TATCGTTTGC CGCCGATGAA GATCATGCGA GCGATCTAAA ACTCGCCAGT GATTCCTCGA 3060 
ACTGTGAAAT TGAGGAAGTT CGTGTTACTG GAGATACCCC CAAGGAGCTG ACCCTTGGAG 312 0 
ATAGGTACAT GGGCATTGAT GAAGAGTTTC AGACTACTAA TACTGATTAC GACATCACTC 3180 
TTCAAATCAT GAACCCTATT GAACATAGGG TTTCGCGTGT TATTGATACA CACTGCCATC 3240 
CAGATAACCC TGACATCTCT ACTGGGCCAA TTTATATGGA GAGAGTCAGC CTTGCTAGAA 3300 
CAGAAGCGAC CAGTCATTCC ATACTGCCAA CCCATGCTTA TTTCGATGAT TCGTACCATC 3360 
AAGCCCTTGT TGAAAATGGT GATTATTCCA TGGACTTTGA TAGGATCAGA CTTAAGCAAA 3420 
GTGATGTAGA CTGGTATAGG GACCCCGATA AATATTTTCA ACCAAAAATG AATATCGGGA 3480 
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GTGCTCAGCG AAGAGTTGGT ACTCAGAAAG AAGTCTTAAC CGCACTCAAA AAGCGAAACG 3 54 0 
2120 CGGACGTTCC AGAAATGGGA GACGCGATTA ACATGAAGGA C ACTG CG AAA GCTATAGCAA 3600 
AGCGCTTTCG TAGCACATTC CTTAATGTTG ACGGTGAAGA CTGTCTGAGA GCTTCTATGG 3 660 
ATGTCATGAC TAAATGTCTT GAGTACCATA AGAAGTGGGG TAAGCACATG GACTTGCAAG 372 0 

2125 

GTGTGAATGT GGCAGCAGAG ACTGATTTAT GTCGGTACCA GCATATGCTG AAGTCTGACG 3 780 
TAAAACCTGT TGTAACTGAC ACCCTTCACT TGGAACGAGC AGTAGCAGCT ACTATAACAT 3 84 0 
2130 TTCATAGTAA AGGTGTGACT AGTAATTTTT CACCCTTTTT CACTGCTTGT TTCGAGAAGT 3 900 
TATCACTGGC CCTGAAATCC AGGTTCATTG TGCCTATCGG AAAGATATCC TCTCTGGAGC 3 960 
TTAAGAATGT CCGCTTGAAT AACAGATACT TTCTTGAAGC GGACCTAAGC AAATTTGATA 4 020 

2135 

AATCTCAGGG TGAGCTGCAC CTAGAGTTTC AGAGAGAGAT ACTCCTTGCG CTGGGCTTTC 408 0 
CAGCGCCGCT GACGAATTGG TGGTCTGATT TTCATCGCGA TTCTTATTTA TCAGACCCTC 414 0 
2140 ATGCCAAGGT GGGAATGTCC GTTTCCTTCC AACGCAGAAC TGGTGACGCG TTTACATATT 4200 
TCGGTAATAC TCTTGTCACT ATGGCTATGA TTGCATATGC CTCTGATC T A AGTGACTGTG 4260 
ACTGTGCAAT ATTTTCAGGA GATGATTCTT TAATCATCTC TAAAGTTAAG CCAGTCCTGG 4320 

2145 

ATACCGATAT GTTTACGTCT CTCTTCAATA TGGAGATAAA AGTCATGGAC CCTAGTGTGC 4 380 
CCTACGTTTG TAGTAAGTTT CTCGTCGAAA CTGAAATGGG CAATTTGGTG TCTGTACCAG 444 0 
2150 ATCCTCTGAG AGAGATCCAG CGCTTAGCTA AGCGAAAGAT TCTGCGTGAT GAACAGATGC 4500 
TCAGAGCACA TTTCGTTTC C TTCTGTGATC GAATGAAGTT TATTAATCAA CTTGATGAGA 4560 
AGATGATTAC GACGCTCTGT CATTTTGTTT ATCTGAAATA TGGGAAAGAA AAACCTTGGA 4620 

2155 

TTTTCGAGGA GGTTAGAGCT GCTCTTGCGG CTTTTTCTTT ATACTCCGAG AATTTCCTGA 4680 
GGTTCTCTGA TTGCTACTGT ACCGAAGGCA TCAGAGTTTA TCAGATGAGC GATCCTGTAT 4740 
2160 GTAAGTTCAA ACGCACCACG GAAGAGCGTA AAACTGATGG TGACTGGTTT CACAACTGGA 4 800 
AGAAT CC AAA GTTTCCTGGT GTGTTAGACA AAGTCTACAG AACCATTGGA ATTTATTCCT 4860 
CGGACTGTAG TACTAAGGAG CTCCCTGTCA AACGGATCGG ACGTTTACAT GAGGCCCTTG 4920 

2 165 

AGCGTGAGTC ACTCAAATTA GCTAATGATC GTAGGACCAC ACAACGCTTG AAAAAGAAGG 4980 
TCGACGATTA CGCTACCGGT AGAGGAGGCC TAACGTCAGT TGATGCTTTG CTCGTGAAGT 504 0 
2170 CCCATTGTGA GACTTTTAAG CCCTCTCATC TGAGATGATC GGTTCTATGA TATATGAACC 5100 



2175 
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TAAGCTGTGA ACAGCCCTTT GGTTAAGGTT AAAAACTCCT GGTCAGGCAG ACCACTTTGG 516 0 
CTAAGTTTAA AAGCTGGGGA TCCTCTAGAG TCCGCAAATC AC CAGTCTCT CTCTACAAAT 5220 
CTATCTCTCT CTATTTTCTC CAGAATAATG TGTGAGTAGT TCCCAGATAA GGGAATTAGG 52 8 0 
GTTCTTATAG GGTTTCG CTC ATGTGTTGAG CATATAAGAA AC CCTTAGTA TGTATTTGTA 534 0 
2180 TTTGTAAAAT ACTTCTATCA ATAAAATTTC TAATTCCTAA AACCAAAATC CAGTGACCTG 54 00 
CAGGCATGCA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGT CACTGGATTT 5460 
TGGTTTTAGG AATTAGAAAT TTTATTGATA GAAGTATTTT ACAAATACAA ATACATACTA 5 520 

2185 

AGGGTTTCTT ATATG CTCAA CACATGAGCG AAACC CTAT A AGAACCCTAA TTCCCTTATC 5 580 
TGGGAACTAC TCACACATTA TTCTGGAGAA AATAGAGAGA GATAGATTTG TAGAGAGAGA 564 0 
2190 CTGGTGATTT GCGGACTCTA GAGGATCCCC GGGTACCGAG CTCGAATTCT CGAGCAGAGG 5 700 
TCTCACACAG AGACAAGCGC ATCACTTAAC ACAATTAAAG ATCAAATCAC CAGCGAG CTC 5760 
GCCGTTAAAG CAATACTCAA AGGACTTCTT GTGTCGTGTT AAGGCAACCA AAC AGTACTC 5 8 2 0 

2195 

CTCATGTTTA AACAAATCAC ATTTGGTCGA CTTAAGCCGA AC CAAAGTGA CGTTGTCAAC 588 0 
AGAGATCCCT TGCGCTTCGT GTACTGTTTT TATGTGTCCA TCAATCCAGT CCTTGCTCAC 5 94 0 
2200 GGGAAAATCC TTAGCCCTCG TTTGAAGGGC CGCTTTATCA GCTTGAGTCA TCGTAAGATA 6 000 
CGTTCTGTTC GGATCAATAG TGACCTGCAA ACCAGAAGTA ATACGACGCT TCGTGAGACT 6 06 0 
TCTAGAAACT TTGGACTCAG ATGTCCAGGA TTGATACTTC GTGTCCCTAT TACCGCATTT 612 0 

2205 

ACGCTTCAGC AGATTAACAG CAGCGATAAC ATCTTGCGGA CACCGGTAAG TCTTGTGAAC 6180 
AACGTCACGG CGATCATATT GCAGATTACC GTGGAGCAAT TTAAAACCCG CGTCACGAGA 6240 
2210 CTTGAACGAA ATCTGCTCTG TGTCCCCAAA GGCAAGAACT TGTGAACATT TAGACAGAGC 6 300 
AGCCACCACC AGGAGTTGAC CATAATGTAG TAAACCAGCC TCATCAACAA GCAGCCTATG 6 360 
ACAGGACGGT ACACCGTGCA TGATCGCAGA ATCCGCGGTG CGCACAACGT CCAAAGCTAC 6420 

2215 

CTTGGAATTA TAAGTGTCAG GGAATAAAGC CATCCTGACG TCCTCGGCCG ATTTACGATT 64 80 
CGCCGTCACA ATTAGGTCCT CTCCCATACG GAATG CATCT TTTATGGCAG TGGTTTTACC 654 0 
2220 GCATCCCGCA ACTCCATCAA CCATGGAAAT ATCGCATGTA GGGACAGAAA CTTTGGCGCT 6600 
AGCTTCTGCA ATGTCCCTCA AGTTAGAGCA TGCACATGTT TTATCAACAA TGTACGTTTC 6660 
ATCTGCGTGC TTCGGACCTA AACCATGCTC ATTATATCCA ACAGTGTAAT CGTATTTTTT 6720 
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AGGATACAAC CAGTTACCGT TGGCCAAATG GACATTCACC ATATCGTCTA TGCGATGGTA 678 0 
GGTCTCAAAG ATG CTCTTAT TTGCGATCTC ACTTCCGCGA CCGCCGGAAA TGTCCCATAG 6840 
2230 GTGACGAAGA TTAGACTCGG AGTTGTTATG TAATCTCTTA CAATAACGCA CAAATTCCTT 6 900 
CATGGCTCCG TGTCTAGATA TGCCACGAGG GTCCGTTGGT ACCTCAACAG ACACCTCGGC 6 96 0 
ATCCGGGACC ACATCAGTCA CCGGTTTAAC GTCATCACTG ACGG ACT C AG GGCTCGAACT 7 020 

2235 

CTCAGGGGCA TCATGAAACT CCTCCTGAGG TATCTCAGCA GCTGGCGGGA CTTTCGCCTT 7080 
CTTCTTCGAG CGCTTGGTCT TGGCTGTCTG CACTTCATGC TCCAGCCGGT CGAATAAGTC 714 0 
2240 CTCTTCAGTC CAAAACGTTC TCAAACGTGA TATCGGTACA GAATCTTGCT CAAATTCTTC 7200 
AACGTTTGAG AGACGAGTCA GAAACTTAAA ACTGTCCGCA TAAGAATCCA GACGTAGTAG 7260 
GGGAAATCTG CTAGCCAATG TTCTCAG CCA TCCTACTTTC GCCCTGGATG AATCTCCACC 7320 

2245 

CCACCAAAAC CTAGTTTTGA AGTGATGGCA CCAACCTTTC CATTCCATCC CATCG CGGAG 7380 
GGC CGTAAGC TTTTCGTACT TTTGATACAG ATTCAAAGTC AAAGCAAAGG CCACTAGATG 7440 
2250 ATAATCTTCA ATGTCTAAGC GCTCACCAGC CATGATAGCC TGACCGTTAA TAATAACAGT 7500 
CGACGACTTG GCGGATAAGA TAGATGCGAC AGCTTTCATG TTCTCAGTCC ATTCTTTACT 7560 
TTCCTTGAAA CATCTGAAAG CTATCTCCTC TACCTCTCTC ACTGTGGTTT TGGCGACGCG 7620 

2255 

CACACATTTC CAGCGATTGA GACTCCAGTC TTCAGGTATT GAGACCCCTA CGTACTTAGA 768 0 
TATGTCTTCA AACCATACAC AGTGACGTAG TGTCTCCCGG GGGCAGCGTA AATTTGTAGC 774 0 
2260 GATGATCTTA TAGGTCATGA TGTTACATTT CAGCATTTCG CGCTCCAACA GATAGGTGGT 780 0 
TCCATCGATG CAATGCACCG ACTCGGTGAA AAATGAGCCC AAATCTTGCC ATCCGTGGAT 7860 
GTAAGATAAT GTGCTTTCAT TTTCAAAATC GAATTTGATC ACCTCATCCG CGCCTGACCC 7920 

2265 

GTCACGTTG C CAGTGACATT TAAGCAAGGG AAGAAAACCC TCGCGGTCAA ACAACATGGC 798 0 
GCCGTCGAAC ATAACGGTAC CACGTAGTAC GCGTACTCCA TGCGAATGCA TGGCGTCACA 8040 
2270 CAGACCTTGG AAGCCCATAT CATAACCGCC GTGGATACAG ATAGCCCAAT CAGCTTGGAC 8X00 
ATCACAATCT TGAGCTCGGT TAAGACAAAA GTTCGGGACT TCATCGAAAT CATCGCTTTC 8160 
TTGCAAAATT TTTCGCATGC GGCACATCCT CTCCTCATGT CGGGCAGCGT CTCTAACACC 8220 
CAACACAGGA CAACAACTGT GCACCCTTTT ATC CCTTCTT GAAAAGTGAT GCCACCAAGA 8280 
CCCTCCGAAA TCTATAACGG GGTCTTCAGG GGGAAAACTG TCGAGACAGT CATAATGCTC 8340 



2275 
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2280 CGCTACACGC AGAGCACCAG CCAGGCTATG GGGCGCATGA TACTGCTGAG TCAAATTTAA 84 00 
GTCAAAGGCA CCACCATAAC GGTCACGGAA GGCGTCAGCC TCCTCAATAG AGAGCTTATT 84 60 
GCGAACGTTG ATTTTCTTAG ACCTTTTCGC GTATTCAATC TGCG CAGATA ACTGTTGCGC 8520 

2285 

AACCTGATTG TCTACGATGT CTTGGGCACT CTGGCTGTCA GCACCCTTCT CAGCAATCAA 8580 
CTTCAGCAAA TCGATAGAAC TTGACATTTT GTTGGTGAAA AACAAAGAAC AAGTAGCAGA 8640 
2290 ACCGTGGTCG AGGTCCTCTC CAAATGAAAT GAACTTCCTT ATATAGAGGA AGGGTCTTGC 8700 
GAAGGATAGT GGGATTGTGC GTCATCCCTT ACGTCAGTGG AGATATCACA TCAATCCACT 8 760 
TGCTTTGAAG ACGTGGTTGG AACGTCTTCT TTTTCCACGA TGTTCCTCGT GGGTGGGGGT 882 0 

2295 

CCATCTTTGG GACCACTGTC GGTAGAGGCA TTCTTGAACG ATAGCCTTTC CTTTATCGCA 8880 
ATGATGGCAT TTGTAGAAGC CATCTTCCTT TTCTACTGTC CTTTCGATGA AGTGACAGAT 894 0 
2300 AGCTGGGCAA TGGAATCCGA GGAGGTTTCC CGATATTACC CTTTGTTGAA AAGTCTCAAT 9000 
AGCCCTCTGG TCTTCTGAGA CTGTATCTTT GATATTCTTG GAGTAGACGA GAGTGTCGTG 9060 
CTCCACCATG TTGACCGGGT GGTCAGTCCC TTATGTTACG TCCTGTAGAA ACCCCAACCC 9120 

2305 

GTGAAATCAA AAAACTCGAC GGCCTGTGGG CATTCAGTCT GGATCGCGAA AACTGTGGAA 9180 
TTGATCAGCG TTGGTGGGAA AGCGCGTTAC AAGAAAGCCG GGCAATTGCT GTGCCAGGCA 924 0 
2310 GTTTTAACGA TCAGTTCGCC GATGCAGATA TTCGTAATTA TGCGGGCAAC GTCTGGTATC 9300 
AGCGCGAAGT CTTTATACCG AAAGGTTGGG CAGGCCAGCG TATCGTG CTG CGTTTCGATG 9360 
CGGTCACTCA TTACGGCAAA GTGTGGGTCA ATAATCAGGA AGTGATGGAG CATCAGGGCG 9420 

2315 

GCTATACGCC ATTTGAAGCC GATGTCACGC CGTATGTTAT TGCCGGGAAA AGTGTACAAT 9480 
TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT 9540 
2320 CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 9600 
CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGNN NNNNNAATTC AGTACATTAA 9660 
AAACGTCCGC AATGTGTTAT TAAGTTGTCT AAGCGTCAAT TTGTTTACAC CACAATATAT 9720 

2325 

CCTGCCACCA GCCAGCCAAC AGCTCCCCGA CCGGCAGCTC GGCACAAAAT CACCACTCGA 9780 
TACAGGCAGC CCATCAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9840 
2330 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9900 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9960 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10020 
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2335 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10080 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 10140 
NNNNNNNNNN NNNNNNNNNN NNNNNN 1016 6 

2340 
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